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TRANSMITTAL LETTER TO THE UNITED STATES DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 USC 371 

International Application No.: PCT/BE9 8/00 124 
International Filing Date: August 20, 1998 

Priority Date Claimed: August 20, 1997 

Title of Invention: PEROXISOME-ASSOCIATED POLYPEPTIDE, NUCLEOTIDE SEQUENCE 

ENCODING SAID POLYPEPTIDES AND THEIR USES IN THE DUGNOSIS AND/OR THE TREATMENT OF 
LUNG INJURIES AND DISEASES, AND OF OXIDATIVE STRESS-RELATED DISORDERS 
Applicant(s) for DO/EO/US: Bernard Knoops, Cedric Hermans, Alfred Bernard, Ruddy Wattiez, 

Paul Falmagne 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and 
other information: 

1 . (X) This is a FIRST submission of items concerning a filing under 35 USC 37 1 , 

2. (X) This express request to begin national examination procedures (35 USC 371(f)) at any time rather 

than delay examination until the expiration of the applicable time hmit set in 35 USC 371(b) and 
PCT Articles 22 and 39(1). 

3. (X) A proper Demand for International Preliminary Examination was made by the 19th month from 

the earliest claimed priority date. 

4. (X) A copy of the International Application as filed (35 USC 37 1(c)(2)) 

a) 0 is transmitted herewith (required only if not transmitted by the International Bureau). 

b) (X) has been transmitted by the International Bureau. 

c) 0 is not required, as the application was filed in the United States Receiving Office 

(RO/US). 

5. (X) Amendments to the claims of the International Application under PCT Article 19 (35 USC 

371(c)(3)) 

a) 0 are transmitted herewith (required only if not transmitted by the Intemational Bureau). 

b) 0 have been transmitted by the Intemational Bureau. 

c) 0 have not been made; however, the time limit for making such amendments has NOT 

expired. 

d) (X) have not been made and will not be made. 

6. (X) A copy of the Intemational Prelhninary Examination Report with any annexes thereto, such as any 

amendments made under PCT Article 34. 

7. (X) An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

8. (X) A FIRST preliminary amendment. 

9. (X) Intemational Application as published. 

1 0. (X) PCT Form PCT/IPEA/402. 

11. (X) PCT Form PCT/IB/308. 
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12. (X) PCX request form. 

13. (X) A return prepaid postcard. 

14. (X) The following fees are submitted: 



FEES 





BASIC FEE 






$840 


CLAIMS 


NUMBER 
FILED 


NUMBER 
EXTRA 


RATE 




Total Claims 


30 -20 = 


10 X 


$18 


$180 


Independent Claims 


1 - 3 = 


0 X 


$78 


$0 


Multiple dependent claims(s) (if applicable) 




$260 


$260 




TOTAL OF ABOVE CALCULATIONS 


$1280 






TOTAL FEES ENCLOSED 




$840 



The fee for later submission of the signed oath or declaration set forth in 37 CFR 1.492(e) will be 
paid upon submission of the declaration. 



15. 

16. 
17. 



(X) 

(X) 
(X) 



A check in the amount of $840 to cover the above fees is enclosed. 

The Commissioner is hereby authorized to charge only those additional fees which may be 
required, now or in the future, to avoid abandonment of the application, or credit any 
overpayment to Deposit Account No. 1 1-1410. A duplicate copy of this sheet is enclosed. 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive 
(37 CFR L137(a) or (b)) must be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO: 

KNOBBE, MARTENS, OLSON & BEAR, LLP 

620 Newport Center Drive 

Sixteenth Floor 

Newport Beach, CA 92660 




Signature 
Daniel E. Altman 
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I hereby certify that this correspondence and all marked 
attachments are being deposited with the United States Postal 
Service as first class mail in an envelope addressed to: 
Commissioner of Patents and Trademarks, Washington, D.C. 
20231, on 
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Daniel E. Altman, Reg. No. 34,115 



TRANSMITTAL LETTER TO THE UNITED STATES DESIGNATED/ELECTED 
OFFICE (DO/EO/US) CONCERNING A FILING UNDER 35 USC 371 

International Application No.: PCT/BE98/00124 
International Filing Date: August 20, 1998 
Priority Date Claimed: August 20, 1997 

Title of Invention: PEROXISOME- ASSOCIATED POLYPEPTIDE, NUCLEOTIDE 

SEQUENCE ENCODING SAID POLYPEPTIDE AND THEIR USES IN 
THE DIAGNOSIS AND/OR TREATMENT OF LUNG INJURIES AND 
DISEASES, AND OF OXIDATIVE STRESS-RELATED DISORDERS 

Applicant(s) for DO/EO/US: Knoops et al. 



Applicant herewith submits to the United States Designated/Elected OflBce (DO/EO/US) the 
following items and other information: 

(X) This is a SECOND or SUBSEQUENT submission of items concerning a filing under 3 5 
U.S.C.371. 

(X) Copy of Notification of Missing Requirements Under 35 U.S.C. 371 In The United States 
Designated/Elected Office (DO/EO/US) dated May 4, 2000. 

(X) An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

(X) An extension of time to respond for two month(s) is hereby requested. 

Time Extension Fee: 

(X) two months ($190 small entity) 

08/18/2000 ERififlNDQ 00000139 09486167 
(X) Small Entity Statement. , --*t-ECj2i5 „ . 55^00-OP 

(X) A return prepaid postcard. 

(X) The fee of $65 for submission of the Declaration after 30 months from the priority under 
37 C.F.R. 1.492(e). 
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(X) The fee of $65 for submission of the Declaration after 30 months from the priority under 
37C.F.R. 1.492(e). 



(X) Fees as calculated below: 



FILING FEE PAID ON FEBRUARY 22, 2000 $ 


840 




FEE FOR EXTENSION OF TIME (LARGE ENTITY) 2 months $ 


380 




SURCHARGE 37 CFR 1.16(e) $ + 130 


REMAINDER OF FILING FEE TO BE PAID $ 


440 




TOTAL OF ABOVE CALCULATIONS $ 


1790 




REDUCTION BY 1/2 FOR FILING BY SMALL ENTITY. 
Note 37 CFR 1.9, 1.27, 1.28. If applicable, verified statement must be attached. $ • 


- 895 




TOTAL OF $895 SUBTRACT $840 ALREADY PAID = $ 


55 




TOTAL FEES SUBMITTED HEREWITH $ 


55 





(X) The Commissioner is hereby authorized to charge any additional fees which may be 

required, or credit any overpayment to Deposit Account No. 11-1410. A dupUcate copy 
of this sheet is enclosed. 



SEND ALL CORRESPONDENCE TO: 





KNOBBE, MARTENS, OLSON & BEAR, LLP 

620 Newport Center Drive 

Sixteenth Floor 

Newport Beach, CA 92660 



Signature 




Daniel E. Altman 




Printed Name 




34,115 
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Applicants: Knoops et al. Attorney's Docket No.: VANM143 OOIAPC 

Int'l. Application No.: PCT/BE98/00124 p ^ 

Int'l. Filed: August 20, 1998 
^: PEROXISOME-ASSOCIATED POLYPEPTIDE, NUCLEOTIDE SEQUENCE ENCODING SAID 
#^^LYPEPTIDE AND THEIR USES IN THE DIAGNOSIS AND/OR TREATMENT OF LUNG INJURIES 
ANI\MSEASES, AND OF OXIDATIVE STRESS-RELATED DISORDERS 



VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL-ENTITY STATUS 
I, the undersigned, do hereby declare that: 

[X] I am an official of the small business concern empowered to act on behalf of the concern identified below: 
NAME OF CONCERN: UNIVERSITE CATHOLIQUE DE LOUVAIN 

ADDRESS OF CONCERN: Halles Universitaires, Place de I'Universite 1, B-1348 Louvain-La-Neuve 
BELGIUM 

I further declare that the above-identified smaU business concern qualifies as a small business concern as 
defined in 13 CFR 121.12, and reproduced in 37 CFR 1.9(d), for purposes of paying reduced fees to the 
Umted States Patent and Trademark Office, m that the number of employees of the concem, including 
those of its affiliates, does not exceed 500 persons. For purposes of this statement, (1) the number of 
employees of the business concem is the average over the previous fiscal year of the concem of the persons 
employed on a fiill-time, part-time or temporary basis during each of the pay periods of the fiscal year, and 
(2) concerns are affiliates of each other when either, directly or indirectly, one concem controls or has the 
power to control the other, or a third party or parties controls or has the power to control both. I further 
declare that rights under contract or law have been conveyed to and remain with the small business concem 
identified above with regard to the invention described in the patent or application identified above. 

The individual, concem or organization identified above has not assigned, granted, conveyed or licensed, and is 
under no obligation under contract or law to assign, grant, convey or license, any rights in the invention to any 
person who would not qualify as an independent inventor under 37 CFR 1.9(c) if that person had made the 
invention, or to any concem which would not qualify as a small business concem under 37 CFR 1 9(d) or a 
nonprofit organization under 37 CFR 1 .9(e). 

If the rights held by the above-identified individual, concem or organization are not exclusive, each individual 
concem or organization having rights in the mvention are identified below. Each such individual, concem or 
orgamzation must file separate verified statements averring to their status as small entities. 

*NOTE: Separate verified statements are required from each named person, concern or organization having 
rights to the invention averring to their status as small entities. (37 CFR 1.27). 

FULL NAME: UNIVERSITE DE MONS-HAINAUT 
ADDRESS: Place du Pare 20, B-7000 Mons, BELGIUM 

[ ] INDIVIDUAL [X] SMALL BUSINESS CONCERN [ ] NONPROFIT ORGANIZATION 

I acknowledge the duty to file, in this application or patent, notification of any change in status resulting in loss of 
entitlement to small-entity status prior to paying, or at the time of paying, the earUest of the issue fee or any 
maintenance fee due after the date on which status as a small entity is no longer appropriate. (37 CFR 1 .28(b)). 

I hereby declare that all statements made herein of my own knowledge are trae and that all statements made on 
information and behef are believed to be trae; and further that these statements were made with the knowledge that 
willful false statements and the hke so made are punishable by fine or imprisonment, or both, under section 1001 of 
Title 18 of the United States Code, and that such willful false statements may jeopardize the validity of the 
application, any patent issuing thereon, or any patent to which this verified statement is directed. 



Applicants: Knoopsetal. Attorney's Docket No.: VANM143.001APC 

Int'l. Application No.: PCT/BE9 8/00 124 Page 2 

Int'l. Filed: August 20, 1998 

For: PEROXISOME-ASSOCIATED POLYPEPTIDE, NUCLEOTIDE SEQUENCE ENCODING SAID 
POLYPEPTIDE AND THEIR USES IN THE DIAGNOSIS AND/OR TREATMENT OF LUNG INJURIES 
AND DISEASES, AND OF OXIDATIVE STRESS-RELATED DISORDERS 



NAME OF PERSON SIGNESFG: ^ O O r( J~» MARCEL CROCHET 

TITLE OF PERSON (if not an owner or individual): e^<n^ Rector 

ADDRESS OF PERSON SIGNING: Halles Universitaires, Place de TUniversite 1, B-1348 Louvain-La-Neuve, 

BELGIUM 



SIGNATURE: 




DATE: 
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PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicant 
Int'l 

Appl. No. 

Int'l Filing 
Date 

For 



Examiner 



Knoops, et al. 



PCT/BE98/00124 



August 20, 1998 

PEROXISOME-ASSOCIATED 
POLYPEPTIDE, 
NUCLEOTIDE SEQUENCE 
ENCODING SAID 
POLYPEPTIDE AND THEIR 
USES IN THE DIAGNOSIS 
AND/OR TREATMENT OF 
LUNG INJURIES AND 
DISEASES, AND OF 
OXIDATIVE STRESS- 
RELATED DISORDERS 

Unknown 



Group Art Unit Unknown 



PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Dear Sir: 

Preliminary to Examination on the merits, please amend the above-captioned patent 
application as follows: 
IN THE SPECIFICATION 

On page 1 of the Specification after the Title of the Invention and before the Field of the 
Invention, on line 13, please insert —This U.S. National Phase application claims priority under 
35 U.S.C. §371 of International Application PCT/BE98/00124, filed August 20, 1998, which 
claims priority of Belgian apphcation BE 9700692, filed August 20, 1997.-. 
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InVl AppL No. : PCT/BE98/00124 
Int'l Filing Date : August 20, 1998 

On page 21, before Claim 1, please cancel the word "CLAIMS" and substitute therefore 

--WHAT IS CLAIMED IS:--. 

IN THE CLAIMS 

1. (Amended) An isolated or purified [ Amino acid sequencel polvpeptide 
[having] comprising and amino acid sequence more than 70% [homology withi homologous to 
[the sequence] SEQ ID [NO 21 NO:2 . 

2. (Amended) [Amino acid sequencel The isolated or purified polypeptide 
according to claim 1, [having] more than 85% [homology with the sequencel homologous to 
SEQ ID [NO 21 NO:2 . 

3. (Amended) [Amino acid sequencel The isolated or purified polypeptide 
according to claim 1 [or 2], [having] more than 95% [homology with the] homologous to 
sequence SEQ ID [NO 2] NO:2 . 

4. (Amended) [Amino acid sequencel The isolated or purified polypeptide 
according to [any one of the preceding claimsl claim L [corresponding tol comprising SEQ ID 
[NO 2] NO:2 or an immunoreactive portion thereof 

5. (Amended) An isolated or purified polynucleotide [Nucleotide sequence] 
encoding the amino acid sequence according to [any one of the preceding claimsl claim 1 and 
[presenting] more than 70% [homology withl homologous to SEQ ID [NO ll NO:l or its 
complementary strand. 

6. (Amended) An isolated or purified polynucleotide [Nucleotide sequence] 
according to claim 5, [having] more than 85% [homo[ogy with the sequencel homologous to 
SEQ ID [NO 1] N0:1 or its complementary strand. 

7. (Amended) An isolated or purified polynucleotide [Nucleotide sequence] 
according to claim 5 more than 95%> [homology with the sequencel homologous to SEQ ID [NO 
1] NQ:1 or its complementary strand, 

8. (Amended) An isolated or purified polynucleotide [Nucleotide sequence] 
according to [any one of the claims 5 to 7, corresponding to the sequence] claim 5 comprising 
SEQ ID [NO ll NOiL its complementary strand or a portion thereof specific for SEQ ID [NO 
1] N0:1 and comprising more than 15 base pairs. 

9. (Amended) A[V]vector comprising the [nucleotide sequence according to 
any one of the] polynucleotide of claim[s] 5 [to 8]. 
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Filing Date 



PCT/BE98/00124 
August 20, 1998 

10. (Amended) A purified antibody or an active portion of said antibody [inhibitor 
direbted againstjthat specifically binds to the polypeptide [amino acid or nucleotide sequence 
acccrding to any one of the] of claim[s[ I [to 8]. 

11. (Amended) [Inhibitor according to claim 10, being an] The purified 
antiUody, [preferably] of claim 2 wherein said antibody is a monoclonal antibody [, or a portion 
of s^id antibody]. 

12. (Amended) A [D] diagnostic device comprising an element selected from the 
grout) consisting of the amino acid sequence [according to any one of the claims 1 to 4] of 
claint 1 , the nucleotide sequence [according to any one of the claims 5 to Sl of claim 2 , the 



[inhibitor according to claim 10 or 11] antibody of claim 10 , their portions [or] and a mixture 
therepf. 

13. (Amended) [Method] A method for the in vitro detection of lung injuries and 
diseabes or oxidative stress-related diseases and disorders, [especially inflammatory 
diseases,] comprising the steps of: 

-isolating a sample from a body fluid of a patient, [preferably a human patient,] 
[-possibly inhibiting the contaminants present in said sample,] 

-[put in] contacting said sample with an element selected from the group consisting of 
the aAiino acid sequence [according to any one of the claims 1 to 4] of claim 1 , the nucleotide 
sequence [according to any one of the claims 5 to 8] of claim 5 , the [inhibitor according to 
claimi 10 or 11] antibody of claim 1 Q , their portions [or] and a mixture thereof, and 
-detecting a reaction of a molecule present in said sample with said element. 

14. (Amended) [Pharmaceutical] A pharmaceutical composition comprising a 
phamiaceutically acceptable carrier and an element selected from the group consisting of the 
ammd acid sequence [according to any one of the claims 1 to 4] of claim 1 , the nucleotide 
seque ice [according to any one of the claims 5 to 8] of claim 5 , the [inhibitor according to 
claim 10 or 11] antibody of claim 10 , their portions [or] and a mixture thereof. 

15. (Amended) [Use of the pharmaceutical composition according to claim 14 
for tile manufacture of a medicament for the prevention and/or the treatment of lung 
injuries or diseases, and ofl The method of claim 13 wherein said oxidative stress-related 
diseases or disorders[, such as] are selected from the group consisting of: specific cardio-vascular 
diseases [like arteriosclerosis,] neiurodegenerative disorders [such as Alzheimer's disease, 

-3- 



Int'l Appl. No. : PCT/BE98/00124 
InVl Filing Date : August 20, 1998 

Parkinson^s disease, amyotrophic lateral sclerosis,] apoptosis and inflammatory reactions, 
allergic reactions [such as asthma, hay fever and eczema,] high bone mass sjmdrome, 
osteopetrosis, osteoporosis-pseudoglioma syndrome, and Bardet-Biedl syndrome 1. 

16. (Amended) [Cell] A cell transformed by the vector according to claim 9 or 
comprising a partial or total Renomic deletion of [its] SEO ID N0:1. or a homologue thereof 
[nucleotide sequence according to any one of the claims 5 to 8]. 

1 7. (Amended) [Non-human] A non-human transgenic animal, [ preferably a 
mammal] transformed by the vector according to claim 9 or comprising a partial or total 
genomic deletion of [its] SEP ID NO:L or a homologue thereof [nucleotide sequence according 
to any one of the claims 5 to 8]. 

Please add the following claims: 

18. The method of claim 13, wherein said oxidative stress-related diseases and 
disorders are inflammatory diseases. 

19. The method of claim 13, further comprising the step of inhibiting the 
contaminants present in said sample. 

20. The method of claim 13, wherein said patient is a human patient. 

21. The method of claim 15 wherein said specific cardio-vascular diseases is 
arteriosclerosis. 

22. The method of claim 15 wherein said neurodegenerative disorders are selected 
from the group consisting of: Alzheimer's disease, Parkinson's disease, and amyotrophic lateral 
sclerosis. 

23. The method of claim 15 wherein said allergic reactions are selected from the 
group consisting of: asthma, hay fever and eczema. 

24. The transgenic animal of claim 16, wherein said animal is a mammal. 

REMARKS 

Claims 1-17 have been amended to conform to U.S. practice before the USPTO. Claims 
18-24 have been added. Support for added Claims 18-24 can be found in the claims as filed. The 
Specification has been amended to include the priority international document and to correct 
minor informalities. No new matter has been added herewith. As a result of the amendments, 
Claim 1-24 are pending. 



^1 



i 



o 

CO 



Inti Appl. No. : PCT/BE98/00124 
Int'l Filing Date : August 20, 1998 

This Preliminary Amendment enters a Sequence listing, pages 1-13. Enclosed herewith 

are: (1) a paper copy of the Sequence Listing, (2) and a computer readable version of the 

Sequence Listing. In view of the foregoing, the application is believed to fiilly comply with the 

Sequence Listing disclosure requirements. 

VERIFICATIONS UNDER 37 C.F.R. §1.821 ffl & (g^ 

All of the sequences in the attached Sequence Listing were included in the application as 

filedl Pursuant to 37 C.F.R. § 1.821(g), no new matter is being added herewith. As required 

undelr 37 C.F.R. § 1.821(f), I hereby verify that the data on the computer readable disk and the 

pap el: copies of the Sequence Listing submitted herewith are identical. 

Con< elusion 



Should there be any questions concerning this application, the Examiner is invited to 
cont4ct the undersigned attorney at the telephone number appearing below. 

Respectfully submitted, 

KNOBBE, MARTENS, OLSON & BEAR, LLP 



Datec 




Daniel E. Altman 

Registration No. 34, 1 1 5 

Attorney of Record 

620 Newport Center Drive 

Sixteenth Floor 

Newport Beach, CA 92660 

(949) 760-0404 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: UNIVERSITE CATHOLIQUE DE LOUVAIN 

Halles Universi taires 

(B) STREET: Place de 1' Universite, 1 

(C) CITY: LOWAIN-LA-NEUVE 

(E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP): B-1348 

(A') NAME: UNIVERSITE DE MONS-HAINAUT 

(B) STREET: Place du Pare 2 0 

(C) CITY: MONS 

(E) COUNTRY: BELGIUM 

(F) POSTAL CODS (ZIP) : B-7000 

(11) TITLE OF INVENTION: PEROXISOME-ASSOCIATED PEPTIDE, NUCLEOTIDE 
SEQUENCE ENCODING SAID PEPTIDE AND THEIR USES IN THE 
DIAGNOSTIC AND/ OR THE TREATMENT OF LUNG INJURIES AND 
DISEASES, AND OF OXIDATIVE STRESS -RELATED DISORDERS 

(ill) NUMBER OF SEQUENCES: 19 

Civ) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.3 0 (EPO) 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 193 . .681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GCCAGGAGGC GGAGTGGAAG TGGCCGTGGG GCGGGTATGG GACTAGCTGG CGTGTGCGCC 6 0 



CTGAGACGCT CAGCGGGCTA TATACTCGTC GGTGGGGCCG GCGGTCAGTC TGCGGCAGCG 12 0 



GCAGCAAGAC GGTGCAGTGA AGGAGAGTGG GCGTCTGGCG GGGTCCGCAG TTTCAGCAGA 180 

GCCGCTGCAG CC ATG GCC CCA ATC AAG GTG GGA GAT GCC ATC CCA GCA 22 8 

Met Ala Pro lie Lys Val Gly Asp Ala lie Pro Ala 
15 10 

GTG GAG GTG TTT GAA GGG GAG CCA GGG AAC AAG GTG AAC CTG GCA GAG 27 6 

Val Glu Val Phe Glu Gly Glu Pro Gly Asn Lys Val Asn Leu Ala Glu 
15 20 25 

CTG TTC AAG GGC AAG AAG GGT C^TG CTG TTT GGA GTT CCT GGG GCC TTC 324 
Leu Phe Lys Gly Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe 
30 35 40 

ACC CCT GGA TGT TCC AAG ACA CAC CTG CCA GGG TTT GTG GAG CAG GCT 3 72 

Thr Pro Gly Cys Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala 
45 50 55 60 

GAG GCT CTG AAG GCC AAG GGA GTC CAG GTG GTG GCC TGT CTG AGT GTT 42 0 

Glu Ala Leu Lys Ala Lys Gly Val Gin Val Val Ala Cys Leu Ser Val 
65 70 75 

AAT GAT GCC TTT GTG ACT GGC GAG TGG GGC CGA GCC CAC AAG GCG GAA 468 
Asn Asp Ala Phe Val Thr Gly Glu Trp Gly Arg Ala His Lys Ala Glu 
80 85 90 

GGC AAG GTT CGG CTC CTG GCT GAT CCC ACT GGG GCC TTT GGG AAG GAG 516 
Gly Lys Val Arg Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu 
95 100 105 

ACA GAC TTA TTA CTA GAT GAT TCG CTG GTG TCC ATC TTT GGG AAT CGA 5 64 

Thr Asp Leu Leu Leu Asp Asp Ser Leu Val Ser lie Phe Gly Asn Arg 
110 115 120 

CGT CTC AAG AGG TTC TCC ATG GTG GTA CAG GAT GGC ATA GTG AAG GCC 612 
Arg Leu Lys Arg Phe Ser Met Val Val Gin Asp Gly lie Val Lys Ala 
125 130 135 140 

CTG AAT GTG GAA CCA GAT GGC ACA GGC CTC ACC TGC AGC CTG GCA CCC 660 
Leu Asn Val Glu Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro 
145 150 155 

AAT ATC ATC TCA CAG CTC TGA GGCCCTGGGC CAGATTACTT CCTCCACCCC 711 
Asn lie lie Ser Gin Leu * 
160 

TCCCTATCTC ACCTGCCCAG CCCTGTGCTG GGGCCCTGCA ATTGGAATGT TGGCCAGATT 771 

TCTGCAATAA ACACTTGTGG TTTGCGGAAA AAAA 80 5 



(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ala Pro lie 
1 

Glu Gly Glu Pro 
20 

Lys Lys Gly Val 
35 

Ser Lys Thr His 
50 

Ala Lys Gly Val 
65 

Val Thr Gly Glu 



Leu Leu Ala Asp 
100 

Leu Asp Asp Ser 
115 

Phe Ser Met Val 
130 

Pro Asp Gly Thr 
145 

Gin Leu * 




Lys Val Gly Asp 
5 

Gly Asn Lys Val 



Leu Phe Gly Val 
40 

Leu Pro Gly Phe 
55 * 

Gin Val Val Ala 
70 

Trp Gly Arg Ala 
85 

Pro Thr Gly Ala 



Leu Val Ser lie 
120 

Val Gin Asp Gly 
135 

Gly Leu Thr Cys 
150 



Ala lie Pro Ala 
10 

Asn Leu Ala Glu 
25 

Pro Gly Ala Phe 



Val Glu Gin Ala 
60 

Cys Leu Ser Val 
75 

His Lys Ala Glu 
90 

Phe Gly Lys Glu 
105 

Phe Gly Asn Arg 



lie Val Lys Ala 
140 

Ser Leu Ala Pro 
155 




Val Glu Val Phe 
15 

Leu Phe Lys Gly 
30 

Thr Pro Gly Cys 
45 

Glu Ala Leu Lys 



Asn Asp Ala Phe 
80 

Gly Lys Val Arg 
95 

Thr Asp Leu Leu 
110 

Arg Leu Lys Arg 
125 

Leu Asn Val Glu 



Asn lie lie Ser 
160 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Rattus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 136 . .624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



TGCGTCCTAG GCAGCATAGC CGGATCGGTG CTCCGTGCAT CGGCTACTTG GACGTGCGTG 



GCAGGCAGAG CAGGCCGGAA AGGAGCAGGT TGGGAGTGTG GTGGGGCCCG CAGCTTCAGC 



60 
120 




AGTGCCGCGG TGACTATGGC CCCGATCAAG GTGGGAGACA CCATTCCCTC AGTGGAGGTA 180 

TTTGRAGGGG AACCTGGAAA GAAGGTGAAC TTGGCAGAGC TGTTCAAGGA CAAGAAAGGT 240 

GTTTTGTTTG GAGTCCCTGG GGCATTTACA CCTGGCTGTT CCAAGACCCA TCTGCCTGGG 3 00 

TTTGTGGAGC AAGCCGGAGC TCYGAAGGCC AAGGGAGCAC AAGTGGTGGC CTGTCTGAGT 3 60 

GTTAATGATG YCTTCGTGAC TGCAGAGTGG GGTCGAGCCC ACCAGGCAGA AGGCAAGGTT 420 

CAGCTCCTGG CTGACCCCAC TGGAGCTTTT GGAAAGGAGA CAGATTTACT ACTAGATGAT 480 

TCTTTGGTGT CTCTCTTTGG GAATCGTCGG CTAAAAAGGT TCTCCATGGT GATAGACAAG 540 

GGCGTAGTAA AGGCACTGAA CGTGGAGCCG GATGGCACAG GCCTCACCTG CAGCCTGGCC 600 

CCCAACATCC TCTCACAACT CTGAGGCCCT GACCAGAATG TCCTCTGACT CTCCCATCTC 66 0 

CTCCACCCAG CTCTGGGCCA AAGGCCCAGT ACCTCCTTAC CTGAGGGCCA CTGGAATGGA 720 

ACCTTGACAA TATTTCTGCA ATAAACAGTT TAATTTGTGA AAAAAAAAAA AAAAAAAAAA 7 80 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Rattus 

Cix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /product = "Glu or Gly" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 63 

(D) OTHER INFORMATION: /product = "Leu or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-si te 

(B) LOCATION: 79 

(D) OTHER INFORMATION: /product= "Ala or Val" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 



Met Ala Pro lie Lys Val Gly Asp Thr lie Pro Ser Val Glu Val Phe 
15 10 15 




Xaa Gly Glu Pro Gly Lys Lys Val Asn Leu Ala Glu Leu Phe Lys Asp 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 45 

Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Gly Ala Xaa Lys 
50 55 60 

Ala Lys Gly Ala Gin Val Val Ala Cys Leu Ser Val Asn Asp Xaa Phe 
65 70 ' 75 80 

Val Thr Ala Glu Trp Gly Arg Ala His Gin Ala Glu Gly Lys Val Gin 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu Thr Asp Leu Leu 
100 105 110 

Leu Asp Asp Ser Leu Val Ser Leu Phe Gly Asn Arg Arg Leu Lys Arg 
115 120 125 

Phe Ser Met Val lie Asp Lys Gly Val Val Lys Ala Leu Asn Val Glu 
130 135 140 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn He Leu Ser 



Gin Leu 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 99 . .588 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
TGCTCCGTGC ATCGACGTGC TTGGCAGGCA GAGCAGGCCG GAAAGAAGCA OGTTGGGAGT 
GTGGCGGAGC CCGCAGCTTC AGCAGCTCCG CGGTGACCAT GGCCCCGATC AAGGTGGGAG 
ATGCCATTCC CTCAGTGGAG GTATTTGAAG GGGAACCGGG AAAGAAGGTG AACTTGGCAG 
AGCTGTTCAA GGGCAAGAAA GGTGTTTTGT TTGGAGTCCC TGGGGCATTT ACACCTGGCT 



145 



150 



155 



160 



GTTCTAAGAC CCACCTGCCT GGGTTTGTGG AGCAAGCTGG AGCTCTGAAG GCTAAGGGAG 3 00 

CGCAGGTGGT GGCCTGTCTG AGCGTTAATG ACGTCTTTGT GATTGAAGAG TGGGGTCGAG 360 

CCCACCAGGC AGAAGGCAAG GTTCGGCTCC TGGCTGACCC CACTGGAGCC TTTGGGAAGG 42 0 

CGACAGACTT ATTATTGGAT GATTCTTTGG TGTCTCTCTT TGGGAATCGT CGGCTGAAAA 48 0 

GGTTCTCCAT GGTGATAGAC AACGGCATAG TGAAGGCACT GAACGTGGAG CCAGATGGCA 540 

CAGGCCTCAC CTGCAGCCTG GCCCCCAACA TCCTCTCCCA ACTCTGAGGC CCTGGCCAGA 60 0 

TGTCCTCTGA CtCTCCCATC TCTCCCACCC GGCTCTAGGC CAAAAGGCTC GGTACCTCCT 66 0 

TACTGGGAGC CACGT 675 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 



(xi) SEQUENCE DEi 

Met Ala Pro lie 
1 

Glu Gly Glu Pro 
20 

Lys Lys Gly Val 
35 

Ser Lys Thr His 
50 

Ala Lys Gly Ala 
65 

Val lie Glu Glu 



Leu Leu Ala Asp 
100 

Leu Asp Asp Ser 
115 



ICRIPTION: SEQ i; 

Lys Val Gly Asp 
5 

Gly Lys Lys Val 



Leu Phe Gly Val 
40 

Leu Pro Gly Phe 
55 

Gin Val Val Ala 
70 

Trp Gly Arg Ala 
85 

Pro Thr Gly Ala 



Leu Val Ser Leu 
120 



) NO: 6: 

Ala Tie Pro Ser 
10 

Asn Leu Ala Glu 
25 

Pro Gly Ala Phe 



Val Glu Gin Ala 
60 

Cys Leu Ser Val 
75 

His Gin Ala Glu 
90 

Phe Gly Lys Ala 
105 

Phe Gly Asn Arg 



Val Glu Val Phe 
15 

Leu Phe Lys Gly 
30 

Thr Pro Gly Cys 
45 

Gly Ala Leu Lys 



Asn Asp Val Phe 
80 

Gly Lys Val Arg 
95 

Thr Asp Leu Leu 
110 

Arg Leu Lys Arg 
125 




Phe Ser Met Val lie Asp Asn Gly lie Val Lys Ala Leu Asn Val Glu 
130 135 140 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn lie Leu Ser 
145 150 155 160 

Gin Leu 



(2) INFORMATION FOR SEQ ID NO : 7: 

(i) SEQUENCE CHARACTERISTICS: 

(X) LENGTH: 469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

{B) LOCATION: 161. .382 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 

TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 12 0 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA TCAAGGTTCG 18 0 

GCTCCTGGCT GATCCCACTG GGGCCTTTGG GAAGGAGACA GACTTATTAC TAGATGATTC 240 

GCTGGTGTCC ATCTTTGGGA ATCGACGTCT CAAGAGGTTC TCCATGGTGG TACAGGATGG 3 00 

CATAGTGAAG GCCCTGAATG TGGAACCAGA TGGCACAGGC CTCACCTGCA GCCTGGCACC 3 60 

CAATATCATC TCACAGCTCT GAGGCCCTGG GCCAGATTAC TTCCTCCACC CCTCCCTATC 42 0 

TCACCTGCCC AGCCGTGTGC TGGGGCCCTG CAATTGGAAT GTTGGCCAG 469 
(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 161. ,514 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 

TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 12 0 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA TCAAGACACA 180 

CCTGCCAGGG TTTGTGGAGC AGGCTGAGGC TCTGAAGGCC AAGGGAGTCC AGGTGGTGGC 240 

CTGTCTGAGT GTTAATGATG CCTTTGTGAC TGGCGAGTGG GGCCGAGCCC ACAAGGCGGA 3 00 

AGGCAAGGTT CGGCTCCTGG CTGATCCCAC TGGGGCCTTT GGGAAGGAGA CAGACTTATT 3 60 

ACTAGATGAT TCGCTGGTGT CCATCTTTGG GAATCGACGT CTCAAGAGGT TCTCCATGGT 42 0 

GGTACAGGAT GGCATAGTGA AGGCCCTGAA TGTGGAACCA GATGGCACAG GCCTCACCTG 480 

CAGCCTGGCA CCCAATATCA TCTCACAGCT CTGAGGCCCT GGGCCAGATT ACTTCCTCCA 540 

CCCCTCCCTA TCTCACCTGC CCAGCCCTGT GCTGGGGCCC TGCAATTGGA ATGTTGGCCA 600 

G 601 
(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 161 . . 517 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 




TGGGGCCGGC 


GGTCAGTCTG 


CGGCAGCGGC 


AGCAAGACGG 


TGCAGTGAAG 


GAGAGTGGGC 


120 


GTCTGGCGGG 


GTCCGCAGTT 


TCAGCAGAGC 


CGCTGCAGCC 


ATGGCCCCAA 


TCAAGGTGGG 


180 


AGATGCCATC 


CCAGCAGTGG 


AGGTGTTTGA 


AGGGGAGCCA 


GGGAACAAGG 


TGAACCTGGC 


240 


AGAGCTGTTC 


AAGGGCAAGA 


AGGGTGTGCT 


GTTTGGAGTT 


CCTGGGGCCT 


TCACCCCTGG 


300 


ATGTTCCAAG 


GTTCGGCTCC 


TGGCTGATCC 


CACTGGGGCC 


TTTGGGAAGG 


AGACAGACTT 


360 


ATTACTAGAT 


GATTC GCTGG 


TGTCCAT?CTT 


TGGGAATCGA 


CGTCTCAAGA 


GGTTCTCCAT 


420 


GGTGGTACAG 


GATGGCATAG 


TGAAGGCCCT 


GAATGTGGAA 


CCAGATGGCA 


CAGGCCTCAC 


480 


CTGCAGCCTG 


GCACCCAATA 


TCATCTCACA 


GCTCTGAGGC 


CCTGGGCCAG 


ATTACTTCCT 


540 


CCACCCCTCC 


CTATCTCACC 


TGCCCAGCCC 


TGTGCTGGGG 


CCCTGCAATT 


GGAATGTTGG 


600 


CCAG 












604 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

( A ) NAME / KEY : exon 

(B) LOCATION: 2516 . .2710 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2 074 . .213 5 

(ix) FEATURE: 

( A ) NAME / KEY : exon 

(B) LOCATION: 1932 . .1970 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 172 8 . .1859 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 802 . . 936 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 




TCTGTCCCTT AGCGCCCCCG CGGGGGCTTA CCCCATCCCA CTCCATGACC TCCCCTCCCC 6 0 

CCATGGCGAA TTCCCACCTT TCTGTCTTTC ACTCACTTCC TGGAACCGTC CCCAGGGCCT 120 
TGGACCTTCC CCCTTCTCCT CCCAAACCTT GTGAGACCCC ATTCCCTTTC TACTTCATCC 180 
TGCTCTCAAC TTTTGGGCTC CTCAGAGGCC CTCACCCCTG ACTCTCTCTC CCTACCCACT 240 
CTGGTCCCAT GAAGCCCTCA AGTACTCTGG GGATGGATCC TTCCCCCTTC Ai^J\AGATTCC 3 00 

TTCTTTTGTT CTACACCTCC TGGGTGTAGG GGCCTGGACA CCCTCCCCCA ACGTTCCACC 3 60 

TGCCGCTGCC CTTCCTCTTC CTCCTCCTGA GGGTGGGACC CTCAGACCTG GCCAAGATCC 42 0 

TCTCCCTCCA TGTTGTCAGG GACTCCTCCT CACCCCCAAA TACAGCCCTC TAGCCCCTGT 480 
CCATTTTATT CCACTCCTTT CCTGTAACCT AGACAGCATG TTATGCAACC CTTTGCGACA 540 
CATGGGGAAA CCTTCCCTCC CTTCCTCTGT TGTCACCAAT GGCCCCTTAA GAGGAGCAGG 6 00 

GCCACCTTGA AACTTGGAGG ATATGGGGTA ACCCAGTGGG AGCGGGCAGG GAGGGCCCTT 660 
GGAAACTGAC AGGGCTGGAG TATCCTGCTG GGTTTCAGCC CCGGTTCCTG CAGGCACAGC 72 0 

TGCCAGGCTC TCTGTTCACC TTCCTGCCTC TGGTTTGCCC CGGCTCCCTC ACCCCCCTTA 7 80 

CCCTGGAGTC CTTCCTTCTA GGTGGGAGAT GCCATCCCAG CAGTGGAGGT GTTTGAAGGG 840 
GAGCCAGGGA ACAAGGTGAA CCTGGCAGAG CTGTTCAAGG GCAAGAAGGG TGTGCTGTTT 90 0 

GGAGTTCCTG GGGCCTTCAC CCCTGGATGT TCCAAGGTGA GGCCCTTCCC CTTCTGAAGA 960 

TCAGGACCTG GGGATCTTTT GTGTTGCTCT TAAGTCCTCC ACATAGTCCT GATAGGACTC 102 0 

CTAAAAAGCA TTTCAGTGCC ATCACAAAAC AAGTAGAGCT GGGTAGAGCT GGGCGCGGTG 108 0 

GCTCACGCCT GTAATCCCAG CACTTTGGGA GGCCAAGGCG GGTGGATCAC GAGGTCAGGA 1140 

GTCCAAAACC AGCCTGGCCA AGATGGTGAA ACCCTGTCTC TACTAAAAAT GCAAAAAAAT 1200 

CAGCCGGATA TGGTGGCGGG CGCCTGTAAT CCCAGGTATT GGGGAGGCTG AGGCAGAGAA 12 60 

TTGCTTGAAC CCAGGAGGCG TAGGTTGCAG TGAGTGGAGA TCGTGCCTCT GCAGTCCAGC 1320 

CTGGGTGAAA GAGCGAGACT CCGTCTCAAA ATGAAAAAAA AAAAAGAAAA CAAGTAGAGA 13 8 0 

CTGCAAAAAG GGAACAGTAC CGGGAATGTT GGAGAAAAAC ATACTACAAT TAAATCCAAC 1440 

ACCCCTGTTG GTCCTGCTAA ATGACAGGCA CTGTGGAAGG TGCTTGGGAC TCAGATAAAT 1500 

AAGACAAAGA TCTGCCCATG GAAAGTTCAC GTCTGGACCA TAAGGCATTA GGTTTCATTC 15 60 

TGAGCTTCCT AGTGGCCAAG GCAAAAAGGA AATAGAATGG TTTAGACAGC TCTCATTGTC 152 0 

TGATCAAAGG TGTTGAGGCA GAGCACTGAG GAGGGCCTGG AGATAAAGGG TGGGCTGGGG 168 0 

GTCAGATGCA GTTATCCCTT TGCCGACCCT TTGTTCCCCT TCCTCAGACA CACCTGCCAG 174 0 

GGTTTGTGGA GCAGGCTGAG GCTCTGAAGG CCAAGGGAGT CCAGGTGGTG GCCTGTCTGA 1800 

GTGTTAATGA TGCCTTTGTG ACTGGCGAGT GGGGCCQAGC CCACAAGGCG GAAGGCAAGG 18 6 0 




TGAGGTGAGG GGCCTGCAGG GAGTCAGGAC CAGGTAGGAT ATTCTTCTTG TGACCTCTAC 1920 

TTTCTCTGCA GGTTCGGCTC CTGGCTGATC CCACTGGGGC CTTTGGGAAG GTGAGTGTTC 198 0 

CCCTGACCGC CACAGGGACA TGGCGGTGCG GGGAGCAGTG GGGGCCCTTG GCCTCTTCAA 2 04 0 

GGATTTCTGA CACTTTTCTC TGTCTCTTCT TAGGAGACAG ACTTATTACT AGATGATTCG 2100 

CTGGTGTCCA TCTTTGGGAA TCGACGTCTC AAGAGGTAAA AGTGGAGAGT CCTCTGTGGA 2160 

GAAAGTCCTC TGTGGGAGAG AGTCCTCTGT GGGAGAGAGT CCTCTGTGGA GAGGGTCCTC 222 0 

TGTGGGAAGA GTCGTCTGTG GGGGAGATGT GTGGGAGAGA GTCCTGTGTG GGGAGAGTCT 2280 

TCTGTAGGGG AGAGTCCTCT GGGGAGAGAG TCCTGTGTGG GGGAGAGTCC TCTGTGGGGA 2340 

GAGTCCTCTG TGTGGAGAGA GTCCTGTGTG GTGGTGAGTC CTCTGTGGGG GAGAGTCCTC 240 0 

TGTGGGGGGA GTCCTCTCTG GAGTTCTCTT GGGCCCCTGG CTGTTCACTG CCTGTCTCCA 2460 

TGCCCAGCCT CCAJ^^GCCCAG GCTGATGCAG CTGGCTGGGC CCCTCTTTCC GGCAGGTTCT 2 52 0 

CCATGGTGGT ACAGGATGGC ATAGTGAAGG CCCTGAATGT GGAACCAGAT GGCACAGGCC 2580 

TCACCTGCAG CCTGGCACCC AATATCATCT CACAGCTCTG AGGCCCTGGG CCAGATTACT 2 640 

TCCTCCACCC CTCCCTATCT CACCTGCCCA GCCCTGTGCT GGGGCCCTGC AATTGGAATG 270 0 

TTGGCCAGAT 2 710 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCATCCCAG CAGTGGAGGT GTTTG 2 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 





(xi) SEQUENCE DESCRIPTION 



SEQ ID NO: 12 : 



TTGAACAGCT CTGCCAGGTT CACC 



24 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGGAGGTGTT TGAAGGGGAG CCAG 24 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAGGTTCACC TTGTTCCCTG GCTC 24 
{2} INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGGTATGGGA CTAGCTGGCG 2 0 

(2) INFORMATION FOR SEQ ID NO : 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 




• 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CTGGCCAACA TTCCAATTGC AG 



22 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATGTTATGCA ACCCTTTGCG ACAC 24 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTGTTTGAAG GGGAGCCAGG GAAC 24 
(2) INFORMATION FOR SEQ ID NO : 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGAGACAGGG TTTCACCATC TTGG 24 



09/486167 
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PERQXISOME-ASSOCIATED POLYPEPTIDE/ NUCLEOTIDE SSOUSNCE 
10 ENCODING SAID POLYPEPTIDE AND THEIR USES IN THE DIAGNOSIS 
AND/OR THE TREATMENT OF LUNG INJURIES AND DISEASES. AND OF 
OXIDATIVE STRESS -RELATED DISORDERS 

Field of the invention 

15 The present invention is related to a new 

peroxisome-associated polypeptide, the nucleotide sequence 
encoding said polypeptide and portions thereof as well as 
their uses in the diagnosis of several diseases, especially 
the diagnosis and/or the treatment of lung injuries and 

20 diseases, and of oxidative stress-related disorders. 

Background of the invention 

The peroxisomes are organelles nearly 
ubiquitous in eukaryotic cells. They contain enzymes 

25 essential for various catabolic and anabolic pathways. Some 
of these enzymes are expressed constitutively while others 
can be induced under appropriate conditions. Peroxisomes 
carry out a variety of essential reactions such as 
peroxisomal oxidation and respiration, fatty acid beta- 

30 oxidation, cholesterol and doiichol metabolism, ether- 
phospholipid synthesis, and glyoxylate and pipecolic acid 
metabolism. 



wo 99/09054 PCT/BE98/00124 



The peroxisomal respiratory pathway is based 
upon the formation of hydrogen peroxide by a collection of 
oxidases and the decomposition of the H2O2 by catalase. 
These reactions ' are responsible for 2 0% of oxygen 
5 consumption in liver, and several oxidases have been 
identified in peroxisomes. Ethanol elimination via catalase 
in peroxisomes may be significant in addition to the 
oxidation via cytosolic alcohol dehydrogenase. 

The peroxisomal beta-oxidation system 

10 catalyses the beta-oxidat ive chain shortening of a specific 
set of compounds which can not be handled by mitochondria : 
very long chain fatty acids, di- and trihydroxycholestanoic 
acids, pristanic acid, long chain dicarboxylic acids, 
several prostaglandins, several leukotrienes, 12- and 15- 

15 hydroxyeicosatetraeonic acid, and several mono- and 
polyunsaturated fatty acids, which are of direct diagnostic 
relevance for some peroxisomal disorders. 

Peroxisomes play also a major role in the 
synthesis of cholesterol and other isoprenoids. Fibroblasts 

20 from patients affected by disorders of peroxisome 
biogenesis show low capacity to synthesise cholesterol. 

Two ^ enzyme activities responsible for 
introduction of the characteristic ether linkage in ether- 
linked phospholipids (dihydroacetonephosphate 

25 acyltransf erase (DHAPAT) and alkyldihydroxyacetonephosphate 
synthase (alkyl-DHAP synthase) ) are localised in 
peroxisomes. These enzymes are not yet cloned. As 
demonstrated by the identification of patients with 
deficiency of either DHAPAT or alkyl-DHAP synthase with 

30 severe clinical abnormalities, ether-phospholipids are of 
major importance in humans. 



wo 99/09054 ' PCT/BE98/00124 



Peroxisomes are able to detoxify glyoxylate 
via alanine/glyoxylate aminotransferase. The deficiency of 
this cloned enzyme causes hyperoxaluria type I. 
L-pipecolate is a minor metabolite of L-lysine and is 
5 catabolised by the L-pipecolate oxidase localised in 
peroxisomes. The enzyme is deficient in cerebro-hepato- 
renal (Zellweger) syndrome. 

In human, the importance of peroxisomes was 
emphasised by a number of inherited diseases involving 
10 either a defect in the biogenesis of peroxisomes or a 
deficiency of one (or more) peroxisomal enzymes. So far, 12 
different peroxisomal disorders have been described and 
most of them are lethal . 

A wide variety of chemicals have been showed 
15 to produce peroxisome proliferation and induction of 
peroxisomal and microsomal fatty acids-oxidising enzymes 
activities in rats and mice. Several peroxisomes 
proliferators have been shown to increase the incidence of 
liver tumours in these species. Proposed mechanisms of 
20 liver tumour formation by peroxisomes proliferators include 
induction of sustained oxidative stress. 

Therefore, newly identified molecules 
associated with peroxisomes could be used for the 
development of diagnostic tools and possibly for the 
25 improvement of several therapeutical applications of 
various diseases associated with peroxisomal disorders. In 
addition, it is useful to identify the molecules present in 
specific organs like the lung and which may be used as 
specific markers of inflammatory diseases as well as lung 
30 injuries or diseases. 
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Summary of the invention 

The Inventors have isolated and purified a 
new sequence of a low molecular weight human broncho- 
alveolar polypeptide . Said mammal , preferably human, 
5 protein or polypeptide (hereafter identified as B18hum 
protein) has been sequenced and its corresponding genomic 
DNA (SEQ ID NO 8) and cDNA (SEQ ID NO 1) have been 
identified. Similarly, the corresponding nucleotide and 
amino acid sequence from a rat (SEQ ID NO 3 and 4) and from 
10 a mouse (SEQ ID NO 5 and 6) have been obtained. 

Said sequences present several homologies 
with other peroxisomal proteins of yeast and comprise a 
carboxy- terminal tripeptide SQL which is necessary for the 
specific targeting and translocation of several proteins 
15 into the peroxisome. 

Therefore, the present invention is related 
to a new isolated and purified polypeptide sequence having 
a amino acid sequence which presents more than 7 0% 
homology, advantageously more than 85% homology, more 
20 preferably more than 95% homology, with the amino acid 
sequence SEQ ID NO 2 .j Said amino acid sequence is 
advantageously obtained from a mammal, preferably from a 
rat, a mouse or a human. 

The present invention is also related to the 
25 isolated and purified polypeptide sequence corresponding to 
the amino acid sequence SEQ ID NO 2 or a portion thereof, 
preferably an immunoreactive portion (putative immunogenic 
domain or T or B cell epitopes) . 

Said portions are advantageously comprised 

30 between : 

- Glutamic acid position 13 - Glutamic acid position 27 

- Alanine position 26 - Leucine position 36 
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- Alanine position 42 - Glutamic acid position 57 

- Glutamic acid position 57 - Valine position 69 
Valine position 80 - Leucine position 97 

- Arginine position 95 - Leucine position 112 
5 - Serine position 118 - Serine position 129 

Valine position 137 - Threonine position 150 

Preferably, said portion has more than 10, 
20, 30, 50 or 70 amino acids. Specific portions of the 
amino acid sequence SEQ ID NO 2 are also portions of more 

10 than 70 amino acids which present at least 80% of the 
proteinic activity (see example 5) of the complete SEQ ID 
NO 2 sequence. Therefore, the amino acid sequence according 
to the invention can be partially deleted while maintaining 
its activity, preferably its ant i -oxidative activity, which 

15 will be described hereafter. 

According to the invention, the amino acid 
sequence SEQ ID NO 2 presents a pi of 7 . 16 and a molecular 
weight of 17047 Dalton as hereafter defined by 
bidimensional electrophoresis. 

20 The present invention is also related to the 

nucleotide sequence enc/oding the amino acid sequence 
according to the -invention and its regulatory sequences 
upstream said coding sequence. A nucleotide sequence 
encoding the polypeptide according to the invention is a 

25 genomic DNA (see SEQ ID NO 10) , a cDNA (see SEQ ID NO 1) or 
a mRNA, possibly comprising said upstream regulatory 
sequence. Advantageously, said nucleotide sequence presents 
more than 70%, advantageously more than 85%, more 
preferably more than 95% homology with SEQ ID NO 1 or its 

30 complementary strand. 
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According to a preferred embodiment of the 
present invention, said nucleotide sequence corresponds to 
the nucleotide sequence SEQ ID NO 1, its complementary 
strand or a portion thereof. 
5 "A portion of the nucleotide sequence SEQ ID 

NO 1" means any nucleotide sequence of more than 15 base 
pairs (such as a primer, a probe or an antisense nucleotide 
sequence) which allow the specific identification, 
reconstitution or blocking of the complete nucleotide 

10 sequence SEQ ID NO 1 (including its regulatory sequences 
upstream the coding sequence) . 

Said portions allow the specific 
identification, reconstitution or blocking by specific 
hybridisation with the nucleotidic sequence SEQ ID NO 1, 

15 preferably under standard stringent conditions, with 
sequences like probes or primers possibly labelled with a 
compound (radioactive compound, enzyme, fluorescent marker, 
etc.), and can be used in a specific diagnostic or dosage 
method like probe hybridisation (see Sambrook et al . , §§ 

20 9.47-9.51 in Molecular Cloning : A Laboratory Manual , Cold 
Spring Harbor, Laboratory Press, Cold Spring Harbor, New 
York (1989)), genetic amplification (like PCR (US patent 
4,683,195), LCR (Wu et al . , Genomics 4, pp. 560-569), CPR 
(US patent 5,011,769)). 

25 Exemplary stringent hybridisation conditions 

are as follows : hybridisation at 42 ^C in 50% formamide 5x 
SSC, 2 0 mM sodium phosphate, pH 6.8 washing in 0 . 2x SSC at 

55 ^C. It is understood by those skilled in the art that 
variation of these conditions occur based on the length and 
3 0 GC nucleotide content of the sequence to be hybridised. 
Formulas standard in the art are appropriated for 
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determining exact hybridisation conditions (see Sambrook et 
al. 

Preferred examples of said nucleotide 
portions are as follows : 
5 Sequence Position 

5 ' -gccatcccagcagtggaggtgtttg-3 ' (SEQ ID NO 11) 217-241 
5 ' -ttgaacagctctgccaggttcacc-3 ' (SEQ ID NO 12) 261-284 

5' -tggaggtgtttgaaggggagccag-3' (SEQ ID NO 13) 230-253 

5' -caggttcaccttgttccctggctc-3 ' (SEQ ID NO 14) 247-270 

10 5' -gggtatgggactagctggcg-3 ' (SEQ ID NO 15) 33-52 

5' -ctggccaacattccaatt:gcag-3 ' (SEQ ID NO 16) 747-768 

and the sequences of respectively 601 (SEQ ID NO 8) , 604 
( SEQ ID NO 9 ) and 4 6 9 ( SEQ ID NO 7 ) base pairs 
corresponding to specific mRNA alternative splicing of the 

15 BIB human nucleotide sequence as described in Figure 4 (the 
known genomic sequence incorporating several introns and 
exons is represented in the sequence SEQ ID NO 10) . 

Said sequences may be used for a genetic 
amplification or a probe hybridisation as above-described. 

20 The present invention is also related to a 

vector comprising the necpssary elements for the injection, 
transfection or transduction of cells and having 
incorporated one or more of the nucleotide sequences 
according to the invention. The vector according to the 

25 invention is selected from the group consisting of viruses, 
plasmids, phagemides, cat ionic vesicles, liposomes or a 
mixture thereof* Said vector may comprise also one or more 
adjacent regulatory sequences (such as promoter (s), 
secretion and termination signal sequence (s) ) , 

3 0 advantageously operably linked to the nucleotide sequence 
according to the invention. 
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The present invention is also related to the 



cell transformed by said vector and expressing the 
polypeptide according to the invention. 



5 invention can be also introduced in said cell by the 
formation of CaP04 -nucleic acid precipitate, DEAE-dextran- 

nucleic acid complex or by electroporation. 



related to an inhibitor of the polypeptide according to the 
10 invention or the nucleotide sequence according to the 
invention (including the upstream sequences like promoter- 
operator regulatory sequence which may be inhibited by a 
cis- and/or transactivating repressor) . Said inhibitor is 
advantageously an antibody or a fragment of said antibody 
15 such as an hypervariable portion of said antibody directed 
against the amino acid or nucleotide sequence of the 
polypeptide according to the invention. Other examples of 
inhibitors according to the invention are antisense 
nucleotide sequences which allow the blocking of the 
2 0 expression of the nucleotide sequence according to the 
invention. i 



related to a diagnostic device (such as a diagnostic kit or 
a chromatographic column) comprising an element selected 

25 from the group consisting of the amino acid sequence of 
said polypeptide, its nucleotide sequence, and/ or the 
inhibitor according to the invention or a fragment thereof 
as above-described. Said diagnostic device may comprise 
also necessary reactants and media for the diagnostic 

30 and/or dosage of the nucleotide and/or amino acid sequence 
of the polypeptide according to the invention, which are 
based upon the method selected from the group consisting of 



The nucleotide sequence according to the 



Another aspect of the present invention is 



Another aspect of the present invention is 
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in situ hybridisation, hybridisation by labelled 
antibodies, especially RIA (Radio Immuno Assay) or ELISA 
(Enzymes Linked Immuno -S or bent Assay) technologies, 
detection upon filter, upon solid support, in solution, in 
5 sandwich, upon gel, dot blot hybridisation, Northern blot 
hybridisation, Southern blot hybridisation, isotopic or 
non-isotopic labelling (by immunofluorescence or 
biotinilised probes) , genetic amplification, (especially by 
PGR or LCR) , double immunodiffusion technique, counter- 
10 electrophoresis technique, haemagglutination or a mixture 
thereof . 

Another aspect of the present invention 
concerns a diagnosis method wherein a biological sample 
from the patient, such as cephalo-rachidian fluid, serum, 

15 blood, plasma, urine, broncho- alveolar lavage, stomach 
lavage, etc., is isolated from the patient, and is put in 
contact with the diagnostic device according to the 
invention for the diagnosis or the monitoring of an injury 
or a disease, preferably a lung injury or an oxidative 

20 stress-related disorder, affected by the presence of pro- 
oxidant agent or oxidativ^e stress such as specific cardio- 
vascular diseases like arteriosclerosis, neurodegenerative 
disorders (Alzheimer's disease, Parkinson's disease, 
amyotrophic lateral sclerosis), apoptosis, inflammatory 

25 reactions, allergic reactions such as asthma, hay fever and 
eczema, high bone mass syndrome , osteopetrosis , 
osteoporosis-pseudoglioma syndrome , and Bardet-Biedl 
syndrome 1. Said diagnosis and monitoring upon one or more 
biological samples obtained from several tissues from the 

30 patient can be advantageously obtained by one or more of 
the methods above -described, which could be adapted 
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10 



according to the specific biological sample by the person 
skilled in the art . 



invention could be used as a marker for the above- 
5 identified injuries, diseases or disorders in a broad 
spectrum of tissues as shown in the enclosed Figure 1. 



related to a pharmaceutical composition comprising a 
pharmaceutically acceptable carrier and an element selected 
10 from the group consisting of the nucleotide sequence, the 
amino acid sequence of the polypeptide according to the 
invention, the inhibitor directed against said sequences 
and/or one or more portions thereof. 



15 related to the use of the pharmaceutical composition 
according to the invention for the manufacture of a 
medicament in the treatment and/or the prevention of lung 
injuries and/or diseases or of oxidative stress-related 
disorders . 

20 The present invention is also related to a 

prevention and/or treatment method of a patient, especially 
a human patient, preferably affected by lung injuries 
and/or diseases or by oxidative stress-related disorders, 
wherein a sufficient amount of the pharmaceutical 

25 composition according to the invention is administered to 
said patient in order to treat, avoid and/or reduce the 
symptoms of said injuries and/or diseases. 



prevented and/or treated are injuries and/or diseases 
30 affected by the presence of pro-oxidant agents or oxidative 
stress, such as specific cardio-vascular diseases like 
arteriosclerosis, neurodegenerative disorders such as 



Therefore, the product according to the 



A further aspect of the present invention is 



A last aspect of the present invention is 



Other injuries and/or diseases which can be 
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Alzheimer's disease, Parkinson's disease, amyotrophic 
lateral sclerosis, apoptosis and inflammatory reactions and 
some allergic reactions such as asthma, hay fever and 
eczema, high bone mass syndrome, osteopetrosis, 
5 osteoporosis-pseudoglioma syndrome, and Bardet-Biedl 
syndrome 1 . 

The pharmaceutically acceptable carrier 
according to the invention is any compatible non-toxic 
substance suitable for administering the composition 

10 according to the invention to a human patient. 
Pharmaceutically acceptable carriers according to the 
invention suitable for oral administration are the ones 
well known by the person skilled in the art, such as 
tablets, coated or non-coated pills, capsules, spray-gas, 

15 patches, gels, solutions or syrups. Pharmaceutically 
acceptable carriers vary according to the mode of 
administration (intravenous, intramuscular, subcutaneous , 
parenteral, etc.), and may comprise also adjuvants well 
known by the person skilled in the art to increase, reduce 

20 and/or regulate humoral, local and/or cellular response of 
the immune system. 

The pharmaceutical composition according to 
the invention may be prepared by the methods, generally 
applied by the person skilled in the art in the preparation 

25 of various pharmaceutical compositions, wherein the 
percentage of the active compound/pharmaceutically 
acceptable carrier can vary within very large ranges, only 
limited by the tolerance of the patient to said 
pharmaceutical composition, and wherein the limits are 

3 0 particularly determined by the frequency of administration 
and the possible side-effects of the active compounds or 
its pharmaceutically acceptable carrier. 
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Another aspect of the invention is related to 



the use of the diagnostic device according to the invention 
for performing upon the patient or upon a biological fluid 
obtained from the patient, a diagnosis, a dosage and/or a 
5 monitoring of the above-mentioned injuries or diseases or 
oxidative stress-related disorders affecting the patient. 



related to a cell or a non-human animal, preferably a 
mammal such as a mouse or a rat, transformed by the vector 

10 according to the invention and overexpressing the 
polypeptide according to the invention, or a non-human 
animal, preferably a mammal such as a mouse or a rat, 
genetically modified by a partial or total deletion of its 
genomic sequence encoding the polypeptide according to the 

15 invention {knock-out non-human mammal) and obtained by 
methods well known by the person skilled in the art such as 
the one described by Kahn et al . {Cell, Vol. 92, pp. 593- 
596 (March 1998) ) . 



20 human animals according to the invention may be a 



according to the invention, preferably an antisense nucleic 
acid sequence complementary to the nucleotide sequence 
according to the invention so placed as to be transcribed 
25 into antisense mRNA which is complementary to the 
nucleotide sequence according to the invention and which 
hybridises to said nucleotide sequence, thereby reducing or 
blocking its translation. 



30 be described in the enclosed non- limiting examples in 
reference to the following Figures. 



A further aspect of the present invention is 



Other examples of genetically modified non- 



transgenic non-human ajnimal comprising an inhibitor 



Further aspects of the present invention will 
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5 Figure 2 



10 Figure 3 



15 



20 



Brief description of the drawings 

Figure 1 represents a dot blot analysis of mRNA 

encoding the polypeptide according to the 
invention in various types of human tissues, 
represents a Northern blot analysis of mRNA 
encoding the polypeptide according to the 
invention in a rat lung after administration 
of lipopolysaccharides (LPS) inducing an 
inflammatory reaction of the lung, 
represents a Northern blot analysis of mRNA 
encoding the polypeptide according to the 
invention in a rat lung after intraperitoneal 
injection of pneumotoxicants . 
Figure 4 is a schematic representation of the human 

genomic sequence, the complete cDNA sequence 
and the corresponding amino acid sequence. 
Figure 5 represents respectively the alignment of the 

sequences of the human B18 polypeptide 
according to the invention with the 
corresponding rat and mouse sequences. 



Example 1 ; Homology between the B18 polypeptide 
according to the invention with other known 
nucleotide or amino acid sequences 

25 The BLAST 2.0 software (gapped BLAST at the 

NCBI Internet site) was used for searching for homologies 
between human B18 (162 amino acids) and known polypeptides 
in databases (GenBank, SwissProt) . Said search did not give 
perfect alignment with known peptides from different 

30 species (Table 1) . Homologies of the human B18 cDNA (805 
nucleotides) with GenBank, EMBL, DDBJ and PDB deposited 
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nucleotide sequences {Table 2) and GenBank Expression 
Sequence TAGS (ESTs) were noted. 
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Table 1 ; Homologies of the B18 proteins (162 amino 

acid) with other proteins 



Name 


NCBI ID 


Identity {%) 
Homology (%) 


Membrane protein 
(synechocystis sp . ) 


1652859 


57/129 (44%) 
81/129 (62%) 


Peroxisomal - like protein 
( Aspergi 1 lus f umigatus ) 


2769700 


56/176 (31%) 
90/176 (50%) 


Haein HI 0572 hypothetical 
protein (Haemophilus 
influenzae) 


1723174 


53/146 (36%) 
80/146 (54%) 


PMP2 0 (Schizosaccharomyces 
Domhe ) 


AJ002536 


54/161 (33%) 
85 /161 (52%) 


Peroxi somal membrane 
protein A (PMP 20) (Candida 

Irioi dinii ) 


130360 


59/170 (34%) 
89/170 (51%) 


Per oxi soma 1 membrane 
protein B (PMP 20) (Candida 
boidinii) 


130361 


58/170 (34%) 
88/170 (51%) 


Put at ive peroxi somal 
protein PMP from yeast 
(Saccharomyces cerevisiae) 


1709682 


41/138 (29%) 
72/138 (51%) 


Al ky 1 hy dr ope r oxi de 
reductase C22 protein 
(Escherichia coli) 


P26427 


36/126 (28%) 
58/126 (45%) 



Table 2 



Name 


Access NO 


Identity 


Human mRNA down- regulated in 
cells infected by adenovirus 5 


U82616 


259/263 (98%) 


Human mRNA down- regulated in 
cells infected by adenovirus 5 


U82615 


300/321 (93%) 



5 
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In the Table 2, an identity of 98% has been 
obtained with the alignment of 259 nucleotides of CDNA B18, 
which comprises in its totality 805 nucleotides, with 263 
nucleotides of U82616 CDNA. A similar identity has been 
5 obtained with the US2615 sequence. 

The sequence SEQ ID NO 1 comprising 8 05 
nucleotides presents a homology with several EST sequences 
obtained from a human and from a mouse, having the 
following references : 

AA130751, N42215, W38597, N91311, N68467, AA187737, 
N68916, W00593, R88950, AA181884, H20154, H66666 
Mouse : 



AA220019, AA123351, AA087129, AA255021, AA249897, W71344 



15 



Example 2 ; Tissue detection 

A human RNA master Blot (Clontech) containing 
100-500 ng of poly-A + human RNA in each dot {normalised to 
the mRNA expression levels of eight different housekeeping 

20 genes) was hybridised with a 554 bp-long B18 probe labelled 
with , and quantified musing Phosphorimaging Technology. 
As shown in Figure 1, B18 mRNA is present in all tissues 
examined but predominantly in trachea, lung, kidney, 
thyroid gland, stomach, colon, heart and some regions of 

25 the brain. Highest expression has been noted in the thyroid 
tissue. This presence is probably correlated with the 
possible antioxidant activity of the B18 polypeptide 
according to the invention. 



3 0 Exajnnple 3 ; Inf lainmatory reaction 

Figure 2 represents a Northern blot analysis 
of rat lung mRNA after 6, 48 and 72 hours after 
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lipopolysaccharides (LPS) instillation inducing an 
inflammatory reaction in the lung. 

A Northern blot containing 15 fig of total RNA 
in each lane was hybridised with a 225 bp- long rat B18 
5 probe, stripped and reprobed with a 572 bp- long rat P-actin 

probe, both labelled with ^^P. Northern blot was quantified 
using Phosphorimaging Technology and the B18 mRNA data were 
normalised to P-actin mRNA level. 



10 TCyaiTtpl Q 4 ; Pneumotoxic reaction 

Figure 3 represents a Northern blot analysis 
of rat lung mRNA after intraperitoneal injection of 
pneumo toxicants (4 -ipomeanol , 1- (3-fyryl) -4-hydroxypentanone 
(IPO) , methylcyclopentadienyl manganese tricarbonyl (MMT) 

15 or alpha naphtyl thiourea (ANTU) ) , These agents are known to 
induce in the lung acute lesions of Clara (IPO) and 
alveolar cells (MMT) as well as increasing the permeability 
of the alveolar/blood barrier (ANTU) . A Northern blot 
containing 15 fxg of total RNA in each lane was hybridised 

20 with a 225 bp-long rat B18 probe, stripped and reprobed 
with a 572 bp-long p-act^in probe both labelled with ^^p^ 
The Northern blot- was quantified using Phosphorimaging 
Technology and rat B18 mRNA data were normalised to p-actin 
mRNA level . 

25 

Example 5 : Proteinic activity of the B18 polypeptide 

An amino analysis of the complete human B18 
amino acid sequence shows that said polypeptide presents 
specific portions showing an homology with other anti- 
3 0 oxidant enzymes (starting from a Leucine at position 3 6 
until a Cysteine at position 47) and an other portion 
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having an important homology with beta chains of ATP 
synthase (starting from a Glutamic acid at position 13 
until a Glycine in position 38) * 

Furthermore, the B18 amino acid sequence 
5 according to the invention shows an important homology with 
an Aspergillus fximigatus allergen (34% identity and 60% 
homology by using clustal V sequence alignment) , especially 
upon the portion of said B18 polypeptide having possible 
antioxidant properties. Therefore, it is possible that a 

10 peroxisomal protein (possibly homologous to BIB protein) is 
able to induce and to bind IgE from patients sensitised to 
Aspergillus fumigatus peroxisomal proteins after an 
induction of the patient immune system with Aspergillus 
fumigatus allergen. This mechanism can be compared to a 

15 reaction obtained with the manganese superoxide dismutase 
(MnSOD) wherein the human MnSOD is able to bind to IgE from 
patients sensitised to Aspergillus fumigatus MnSOD. 

Furthermore, the Inventors have identified a 
portion of the BIB human polypeptide which presents an 

20 homology with a Cyclophil in-binding domain of Candida 
boidinii PMP2 0 (receptor^ of the immuno- suppressant drug 
cyclosporine A) . Said possible Cyclophil in-binding domain 
is starting from the Threonine in position 150 until the 
Leucine in position 161. 

25 

Example 6 ; BIS human gene and mRNA alternative splicing 

As represented in the enclosed Figure 4 , the 
Inventors have identified upon the genomic DNA (SEQ ID NO 
10) 5 exons and 5 introns . By RT-PCR (using primers 5'- 
30 gggtatgggactagctggcg-3 ^ and 5 ' -ctggccaacattccaattgcag- 3 ' ) 
and according to the genomic sequence, 4 different cDNAs 
corresponding to the transcription of the said genomic DNA 
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have been identified in human lung and in human brain. A 
first cDNA of 73 6 bp corresponds to the cDNA encoding the 
complete amino acid sequence of the B18 protein according 
to the invention. However, 3 other cDNAs of 601, 6 04 and 
5 469 bp were also identified, and comprise specific 
splicings of one or more exons . 



invention is related to said specific portions of the 
complete genomic or CDNA nucleotide sequence according to 
10 the invention or to specific portions of the complete amino 
acid sequence of the B18 protein according to the 
invention, which could be used also as specific markers of 
the B18 activity, preferably the ant i- oxidative activity. 

15 "Rya-mpl e 7 ; Knock-out mouse 



the B18 polypeptide according to the invention have been 
deleted by homologous recombination . Said homologous 
recombination has been obtained with a genetic sequence 

20 comprising a neomycin resistant gene. The targeting vector 
with said gene and a ^thymidine kinase {in order to 
eliminate non- homologous recombinants with ganciclovir) has 
been prepared. Said recombination was used for the deletion 
of one or more exons of the B18 polypeptide. After 

25 electroporation of ES cells with the targeting vector, 
positive clones having incorporated homologous 
recombination were identified by Southern blot with 
labelled probes. Aggregation of said positive clones with a 
morula from a Swiss pseudo- pregnant mouse produces several 

3 0 chimeric mice which survive after birth. Several homozygote 
mice are obtained by cross-breeding and are used as a model 
for the above-mentioned diseases. 



Therefore, another aspect of the present 



Exons of a mouse genomic sequence encoding 
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Similar experiments may be done with another 
mammal whose B18 sequence is known {the B18 sequence of a 
mouse and a rat and their alignment with the human sequence 
is shown in the enclosed Figure 5) . 

5 

Exam ple 8 : Chromosome localisation of human B18 gene 

Radiation hybrid clones (GeneBridge 4 
Radiation Hybrid Panel, Research Genetics) were used for 
performing chromosome localisation by PGR with two 

10 different pairs of primers (5 ' -caggttcaccttgttccctggctc-3 ' 
(SEQ ID NO 14), 5' -atgttatgcaaccctttgcgacac-3 ' (SEQ ID NO 
17) and 5' -gtgtttgaaggggagccagggaac-3 ' (SEQ ID NO 18), 
5' _agagacagggtttcaccatcttgg-3' {SEQ ID NO 19)). 

The Inventors have located B18 genomic 

15 sequence on human chromosome llql3 . BIB gene has been 
located 7.15-6.1 cR from marker D11S913 between markers 
D11S1963 and D11S4407 (Genome Database internet site) . 

Unknown genes linked to different disorders 
ha^ve been localised in the same region of chromosome 11. 

20 Therefore, B18 gene is possibly associated with these 
disorders : 

- atopy (atopic hypersensitivity: asthma, hay fever and 
eczema; MIM n^ 147050 at OMIM of NCBI internet site) , 

- high bone mass syndrome (MIM n°601884) , 
25 - osteopetrosis (MIM n^259700) , 

- osteoporosis -pseudoglioma syndrome (MIM n°259770) and 

- Bardet-Biedl syndrome 1 (MIM n<=209901) . 
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CLAIMS 

1. Amino acid sequence having more than 70% 
homology with the sequence SEQ ID NO 2 , 

2- Amino acid sequence according to claim 1, 
having more than 85% homology with the sequence SEQ ID NO 
2 . 

4 

3. Amino acid sequence according to claim 1 
or 2, having more than^ 95% homology with the sBqxiencQ SEQ 
ID NO 2. 

4, Amino «icid sequence corresponding to SEQ 
ID NO 2 or a portion thereof selected from the group 
consisting of the sequences comprised between: 

- the glutamic acid in position 13 and the glutamic acid 
in position 27, 

- the alanine in position 2 6 and the leucine in position 
3 6 , 

- the alanine in position 42 and the glutamic acid in 
position 57/ 

- the glutamic acid in position 57 and the valine in 
position 69; ^ 

_ the valine in position 80 and the leucine in position 
97, 

_ ^jjg arginine in position 95 and the leucine in position 
112 , 

- the serine in position 118 and the serine in position 

129, • r 

- the valine in position 137 and the threon<?ne in position <^ 



- the glutamic acid in position 13 and the cysteine in 
position 47, 

- the glutamic acid in position 13 and the glycine in 
position 38, and 

- the leucine in position 36 and the cysteine in position 
47, 

- and the treonine in position 150 and the leucine in 

position 161. 

5. Nucleotide sequence encoding the amino 
acid sequence according to any- one of the preceding claims 
and presenting more than 70% homology with SEQ ID MO 1 or 
its complementary strand. 

6. Nucleotide sequence according to claim 5, 
having more than 8 5% homology with the sequence SEQ ID NO 1 
or its complementary strand. 

7. Nucleotide sequence according to claim 5 
more than 95% homology with the sequence SEQ ID NO 1 or its 
complementary strand. 

8. Nucleotide sequence corresponding to the 
secaience SEQ ID NO 1, its complementary strand or a portion 
cnereoi- tse— ecL-cuL ^.-uwni v-jj,^»/i_*.s^ ^w***^—*- — — — ^ — » 
SEQ ID n^8, SEQ ID n°9, SEQ ID n^ll, SEQ ID n^l2, SEQ ID 
n°13, SEQ ID n^l4, SEQ ID n^l5 and SEQ ID n<^16. 

9. Vector comprising the nucleotide sequence 
according to any one of the claims 5 to 8 . 

10. Inhibitor directed against the amino acid 
or nucleotide sequence according co any one of the claims 1 

to 8 - >. 

11. Inhiliitor according to claim 10; being an 

antibody, preferably a Monoclonal antibody, or a portion of 
said antibody. 

12. Diagnostic device c"^mprising an element 
selected from the - group consisting of the amino acid 



secruence according to any one of the claims 1 to 4, the 
nucleotide sequence according to any one of the claims 5 to 
8, the inhibitor according to claim 10 or 11, their 
portions or a mixture thereof. 

13. Method for the in vitro detection of lung 
injuries and diseases or oxidative stress -related diseases 
and disorders, especially inflammatory diseases, comprising 
the steps of : 

- isolating a sample from a body fluid of a patient, 
preferably a human patient, 

- possibly inhibiting the contaminants present in said 
sample , 

- put in contact said sample with an element selected from 
the group consisting of the amino acid sequence 
according to any one of the claims 1 to 4, the 
nucleotide sequence according to any one of the claims 5 
to 8, the inhibitor according to claim 10 or 11, their 
portions or a mixture thereof, and 

- detecting a reaction of a molecule present in said 
sample with said elemenc. 

14. Pharmaceutical composition comprising a 
pharmaceutically acceptable carrier and an element selected 
from the group consisting of the amino acid sequence 
according to any one of the claims 1 to 4 , the nucleotide 
sequence according to any one of the claims 5 to 8, the 
inhibitor according to claim 10 or 11, their portions or a 

mixture thereof. 

15. Use of the pharmaceutical composition 
according to claim 14 for the manufacture of a medicament 
for the prevention and/pr the treatment of lung injuries or 
diseases, and of oxidative stress -related diseases or 
disorders, such as specific cardio-vascular diseases like 
arteriosclerosis, neurodegenerative disorders such as 
Al zheimer * s disease , Parkinson ' s disease , amyotrophic 




lateral sclerosis, apoptosis and inflammatory reactions, 
allergic reactions such as asthma, hay fever and eczema, 
high bone mass syndrome, osteopetrosis, osteoporosis- 
pseudoglioma syndrome, and Bardet-Biedl syndrome 1, 
5 16. Cell transfoiined by the vector according 

to claim 9 or comprising a total deletion of its nucleotide 
sequence according to any one of the claims 5 to 8 . 

17 • Non-human animal, preferably a non-human 
mammal, transformed by the vector according to claim 9 or 
10 comprising a total deletion of its nucleotide sequence 
according to any one of the claims 5 to 8 . 
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SUBSTITUTE SHEET {RULE 26) 
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CLUSTAL V alignment of human and rat B18 amino acid sequences (Identity: 
90%, Homology: 97.5%): 

BlShum MAPIKVGDAIPAVEVFEGEPGNKVNLAELFKGKKGVLFGVPGAFTPGCSK = SEQIDNOI 

B18rat MAPIKVGDTIPSVEVFEGEPGKKVNLAELFKDKKGVLFGVPGAFTPGCSK 

B18hum THLPGFN^QAEALKAKGVQWACLSWDAFVTGEWGRAHKAEGKVRLIJ^ 

BlSrat THLPGFVEQAGALKAKGAQVVACLSWDVFVTAEWGRAHQAEGKVQLLAD FIG. 5a 

B18hum PTGAFGKETDLLLDDSLVSIFGNRRLKRFSMWQDGIVKALNVEPDGTGL 
BlSrat PTGAFGKETDLLLDDSLVSLFGNRRLKRFSWIDKGVVKALNVEPDGTGL 

BlShum TCSLAPNIISQL 
BlBrat TCSLAPNILSQL 

CLUSTAL V alignment of human and mouse B18 amino acid sequences (Identity: 
91%, Homology: 96%) : 

BlShum MAPIKVGDAIPAVEVFEGEPGNKVNLAELFKGKKGVLFGVPGAFTPGCSK 
BlSmouse MAPIKVGDAIPSVEVFEGEPGKKVNLAELFKGKKGVLFGVPGAFTPGCSK 



BlShum THLPGFVEQAEALKAKGVQWACLSWDAFVTGEWGRAHKAEGKVRLLAD 

BlSmouse THLPGFVEQAGALKAKGAQWACLSWDVFVTEEWGRAHQAEGKVRLLAD 

BlShum PTGAFGKETDLLLDDSLVSIFGNRRLKRFSKWQDGIVKALNVEPDGTGL 

BlSmouse PTGAFGKATDLLLDDSLVSLFGNRRLKRFSMVIDNGIVKALNVEPDGTGL 

B18h\jm TCSLAPNIISQL 

BlSmouse TCSLAPNILSQL . 

CLUSTAL V alignment of human and rat cDNA sequences (identity: 612/7 80, 
78.5%) : 

BlShxim GCCAGGAGGCGGAGTGGAAGTGGCCGTGGGGCGGGTATGGGACTAGCTGG 

B18rat TG CGTC CTAGGCAG 

B18hum CGTGTGCGCCCTGAGACGCTCAGCGGGCTATATACTCGTCGGTGGGGCCG 

B18rat CATA GCC GGA TCGGTGCTCCGTGCATCGGCTACTTGGAC — 

Bl 8h\mi GCGGTCAGTCTGCGGCAGCGGCAGCAAGACGGTGCAGTGAAGGAGAGTGG 

Bl Brat GTGCGTGGCAGGCAGAGCAGGCCGG AAAGGAGCAGGTTGG 
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BlBhum 
B18rat 



GCGTCTGGCGGGGTCCGCAGTTTCAGCAGAGCCGCTGCAGCCATGGCCCC 
GAGTGTGGTGGGGCCCGCAGCTTCAGCAGTGCCGCGGTGACTATGGCCCC 

* * -tr -k -if -ir -tr -k -k -Ik- * if -ir -^^ *r -ir ^ * -k -k k * -k -k k -k * -k k -k 



BlShum 
BlSrat 



AATCAAGGTGGGAGATGCCATCCCAGCAGTGGAGGTGTTTGAAGGGGAGC 
GATCAAGGTGGGAGACACCATTCCCTCAGTGGAGGTATTTGAAGGGGAAC 

-k^k^-k-k-k-k-k***** -k-k-kk^ -kk * k kr k -k k k * k k -kk^kr^k-kk^kk^ -k 



Biehum 
Bl8rat 



CAGGGAACAAGGTGT^ACCTGGCAGAGCTGTTCAAGGGCAAGT^GGGTGTG 
CTGGAAAGAAGGTGAACTTGGCAGAGCTGTTCAAGGACAAGAAAGGTGTT 

* k-* kr-k -k -k -k -k -k -k * -k -k -k-kkr-k-k-k-kkrk-k-kkk krk-kkrk + ★ * + it * 



BlShum 
BlSrat" 



CTGTTTGGAGTTCCTGGGGCCTTCACCCCTGGATGTTCCAAGACACACCT 
TTGTTTGGAGTCCCTGGGGCATTTACACCTGGCTGTTCCAAGACCCATCT 

★ kr -k -k -k * -k kr k- k-k k^k krkkrkk kr -k k k k k k -k -k -k k k- k- -k k 



BlShum 
B18rat 



GCCAGGGTTTGTGGAGCAGGCTGAGGCTCTGAAGGCCAAGGGAGTCCAGG 
GCCTGGGTTTGTGGAGCAAGCCGGAGCTCTGAAGGCCAAGGGAGCACAAG 

-kkr-k kc -k kr kr -k * k^ k-k kr k kr k- kr kr k 4r k- k k- k kr k k -k -k -k k kr k kr kr -k k- -k 



BlShum 
BlSrat 



TGGTGGCCTGTCTGAGTGTTAATGATGCCTTTGTGACTGGCGAGTGGGGC 
TGGTGGCCTGTCTGAGTGTTAATGATGTCTTCGTGACTGCAGAGTGGGGT 

**kfkkr^kr-kk^kckr-k*k^kr-k*k(krkrkr'k-k'k^kk -k ^ kr kr k -k kr -k k kr k- * k * k kr kc * 



BlShum 
BlSrat 



CGAGCCCACAAGGCGGAAGGCAAGGTTCGGCTCCTGGCTGATCCCACTGG 
CGAGCCCACCAGGCAGAAGGCAAGGTTCAGCTCCTGGCTGACCCCACTGG 

k^ kr * k: kr kr k^ kr -k + ★ * tIt kr -k kr -k kr kr rk k k k k -k k k kr k kr k kr kr k -k k- -k k- + + 



BlShum 
BlSrat 



GGCCTTTGGGAAGGAGACAGACTTATTACTAGATGATTCGCTGGTGTCCA 
AGCTTTTGGAAAGGAGACAGATTTACTACTAGATGATTCTTTGGTGTCTC 

k-kr + * -k -k kr k kr kr kr k- kr k- kr kr k k -k * k- kt k- k k k- * k- k kr k k k -k -k k kc k 



BlShum 
BlSrat 



TCTTTGGGAATCGACGTCTCAAGAGGTTCTCCATGGTGGTACAGGATGGC 
TCTTTGGGAATCGTCGGCTAAAAAGGTTCTCCATGGTGATAGACAAGGGC 

k^ kr k- kr k- kr ^ kr -k kr kr ic kr kr kr kr kr k^ k kr k k- * kr -k k kr kr k' k kr k kr k -k k kr k k 



BlShum 
BlSrat 



ATAGTGAAGGCCCTGAATGTGGAACCAGATGGCACAGGCCTCACCTGCAG 
GTAGTAAAGGCACTGAACGTGGAGCCGGATGGCACAGGCCTCACCTGCAG 

★ kek^-kk^kr *krkkrkr kr kr k kr k^ kr k -k k- k k kr k k- k kr k k^ k k- kr kc kr k kr kr k^ kr kr k 



BlShum 
BlSrat 



CCTGGCACCCAATATCATCTCACAGCTCTGAGGCCCTGGGCCAGATTACT 
CCTGGCCCCCAACATCCTCTCACAACTCTGAGGCCCTGA-CCAGA — ATG 

krkrkkrkrkr kr k- kr -k kr -k kr kr kr kr kr kr kr kr k kr kr k- -k kr kr k- kr -k kr k k- k kr -k kr kr k kr 



BlShum 
BlSrat 



TCCTCCACCCCTCCCTATCTCACCTGCCCAGCCCTGTGCTGG-GGCCCTG 
TCCTCTGACTCTCCC-ATCTCCTCCACCCAGCTCTGGGCCAAAGGCCCAG 

krkrkekkr kr kf kr -k -k kr ***** * ****** *** ** ***** * 



BlShum 
BlSrat 



CA ATTGGAATG TTGGCCAGATTTCTGC 

TACCTCCTTACCTGAGGGCCACTGGAATGGAACCTTGACAATATTTCTGC 
* * ******* *** * * +*^k.4.** + ^ 



BlShum 
BlSrat 



AATAAACACTTGTGGTTTGCGGAAAAAAA 

AATAAACAGTT-TAATTTGTG7W\AAAAAAAAA7W\AAAAAAA 
******** ** * ^^^^ + + ^ + ^ 
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CLUSTAL V alignment of human and mouse cDNA sequences (Identity: 552/675, 
81.8%) : 

FIG.5C 

BlShum GCCAGGAGGCGGAGTGGAAGTGGCCGTGGGGCGGGTATGGGACTAGCTGG 
B18mouse 



B18hum 
Bl Bmouse 



CGTGTGCGCCCTGAGACGCTCAGCGGGCTATATACTCGTCGGTGGGGCCG 
TGCTCCGTG CATCGACGTGCTTG 



BlShuiti 
BlSmouse 



GCGGTCAGTCTGCGGCAGCGGCAGCAAGACGGTGCAGTGAAGGAGAGTGG 
GCAGGCAG AGCAGGCCGG AAAGAAGCAGGTTGG 



B18hum 
BlSmouse 



GCGTCTGGCGGGGTCCGCAGTTTCAGCAGAGCCGCTGCAGCCATGGCCCC 
GAGTGTGGCGGAGCCCGCAGCTTCAGCAGCTCCGCGGTGACCATGGCCCC 



B18hum 
B18mouse 



AATCAAGGTGGGAGATGCCATCCCAGCAGTGGAGGTGTTTGAAGGGGAGC 
GATCAAGGTGGGAGATGCCATTCCCTCAGTGGAGGTATTTGAAGGGGAAC 



BlBhum 
B18mouse 



CAGGGAACAAGGTGAACCTGGCAGAGCTGTTCAAGGGCAAGAAGGGTGTG 
CGGGAAAGAAGGTGAACTTGGCAGAGCTGTTCAAGGGCAAGAAAGGTGTT 



BlShum 
BlBmouse 



CTGTTTGGAGTTCCTGGGGCCTTCACCCCTGGATGTTCCAAGACACACCT 
TTGTTTGGAGTCCCTGGGGCATTTACACCTGGCTGTTCTAAGACCCACCT 



BlShum 
B18mouse 



GCCAGGGTTTGTGGAGCAGGCTGAGGCTCTGAAGGCCAAGGGAGTCCAGG 
GCCTGGGTTTGTGGAGCAAGCTGGAGCTCTGAAGGCTAAGGGAGCGCAGG 



BlBhum 
BlSmouse 



TGGTGGCCTGTCTGAGTGTTAATGATGCCTTTGTGACTGGCGAGTGGGGC 
T GGT GG C CT GT CT GAGC GTT AAT GAC GT CTTT GT GATT GAAGAGT GGGGT 



BlShum 
BlSmouse 



CGAGCCCACAAGGCGGAAGGCAAGGTTCGGCTCCTGGCTGATCCCACTGG 
CGAGCCCACCAGGCAGAAGGCAAGGTTCGGCTCCTGGCTGACCCCACTGG 



BlShum 
BlBmouse 

BlShum 
BlBmouse 



GGCCTTTGGGAAGGAGkcAGACTTATTACTAGATGATTCGCTGGTGTCCA 
AGCCTTTGGGAAGGCGACAGACTTATTATTGGATGATTCTTTGGTGTCTC 

TCTTTGGGAATCGACGTCTCAAGAGGTTCTCCATGGTGGTACAGGATGGC 
TCTTTGGGAATCGTCGGCTGAAAAGGTTCTCCATGGTGATAGACAACGGC 



B18hum 
BlSmouse 



ATAGTGAAGGCCCTGAATGTGGAACCAGATGGCACAGGCCTCACCTGCAG 
ATAGTGAAGGCACTGAACGTGGAGCCAGATGGCACAGGCCTCACCTGCAG 
+ ***** ************, 



BlShum 
BlSmouse 



CCTGGCACCCAATATCATCTCACAGCTCTGAGGCCCTGGGCCAGATTACT 
CCTGGCCCCCAACATCCTCTCCCAACTCTGAGGCCCTGG-CCAGATG— 



p<«rw* 



BlShum 
BlSmouse 



TCCTCCACCCCTCCCTATCTCACCTGCCCAGCCCTGTGCTGGGGCCCTGC 
TCCTCTGACTCTCCC-ATCTCTCCCACCCGGCTCT AGGCC 



B18h\2m 
BlSmouse 



AATTGGAATGTTGGCCAGATTTCTGCAATAAACACTTGTGGTTTGCGGAA 
AAAAGGCTCGGTACCTCCTTACTGGGAGC-CACGT 
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SEQUENCE LISTING 



{!) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: UNI VERS ITE CATHOLIQUE DE LOUVAIN 

Halles Universitaires 

(B) STREET: Place de 1' Universite, 1 

(C) CITY: LOUVAIN-LA-NEXrv:B 

(E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP) : B-1348 

(A) NAME: UNIVERSITE DE MONS-HAINAUT 

(B) STREET: Place du Pare 20 

(C) CITY: MONS 

(E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP) : B-7000 

(ii) TITLE OF INVENTION: PEROXISOME- ASSOCIATED PEPTIDE, NUCLEOTIDE 
SEQUENCE ENCODING SAID PEPTIDE^ AND THEIR USES IN THE 
DIAGNOSTIC AND/OR THE TREATMENT OF LUNG INJURIES AND 
DISEASES, AND OF OXIDATIVE STRESS-RELATED DISORDERS 

(iiil NUMBER OF SEQUENCES: 19 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 193, .681 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GCCAGGAGGC GGAGTGGAAG TGGCCGTGGG GCGGGTATGG GACTAGCTGG CGTGTGCGCC 60 
CTGAGACGCT CAGCGGGCTA TATACTCGTC GGTGGGGCCG GCGGTCAGTC TGCGGCAGCG 120 
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GCAGCAAGAC GGTGCAGTGA AGGAGAGTGG GCGTCTGGCG GGGTCCGCAG TTTCAGCAGA 18 0 

GCCGCTGCAG CC ATG GCC CCA ATC AAG GTG GGA GAT GCC ATC CCA GCA 228 
Met Ala Pro lie Lys Vai Gly Asp Ala lie Pro Ala 
15 10 

GTG GAG GTG TTT GAA GGG GAG CCA GGG AAC AAG GTG AAC CTG GCA GAG 27 6 

Val Glu Val Phe Glu Gly Glu Pro Gly Asn Lys Val Asn Leu Ala Glu 
15 20 25 

CTG TTC AAG GGC AAG AAG GGT GTG CTG TTT GGA GTT CCT GGG GCC TTC 324 
Leu Phe Lys Gly Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe 
30 35 40 

ACC CCT GGA TGT TCC AAG ACA CAC CTG CCA GGG TTT GTG GAG CAG GCT 372 
Thr Pro Gly Cys Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala 
45 50 55 60 

GAG GCT CTG AAG GCC AAG GGA GTC CAG GTG GTG GCC TGT CTG AGT GTT 420 
Glu Ala Leu Lys Ala Lys Gly Val Gin Val Val Ala Cys Leu Ser Val 
65 70 t 75 

AAT GAT GCC TTT GTG ACT GGC GAG TGG GGC CGA GCC CAC AAG GCG GAA 4 68 

Asn Asp Ala Phe Val Thr Gly Glu Trp Gly Arg Ala His Lys Ala Glu 
80 85 90 

GGC AAG GTT CGG CTC CTG GCT GAT CCC ACT GGG GCC TTT GGG AAG GAG 516 
Gly Lys Val Arg Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu 
95 100 105 

ACA GAC TTA TTA CTA GAT GAT TCG CTG GTG TCC ATC TTT GGG AAT CGA 5 64 

Thr Asp Leu Leu Leu Asp Asp Ser Leu Val Ser He Phe Gly Asn Arg 
110 115 120 

CGT CTC AAG AGG TTC TCC ATG GTG GTA CAG GAT GGC ATA GTG AAG GCC 612 
Arg Leu Lys Arg Phe Ser Met Val Val Gin Asp Gly He Val Lys Ala 
125 130 135 140 

CTG AAT GTG GAA CCA GAT GGC ACA GGC CTC ACC TGC AGC CTG GCA CCC 
Leu Asn Val Glu Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro 
145 150 155 



660 



AAT ATC ATC TCA CAG CTC TGA GGCCCTGGGC CAGATTACTT CCTCCACCCC 711 
Asn He He Ser Gin Leu * 
160 

TCCCTATCTC ACCTGCCCAG CCCTGTGCTG GGGCCCTGCA ATTGGAATGT TGGCCAGATT 771 

TCTGCAATAA ACACTTGTGG TTTGCGGAAA AAAA 8 05 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 163 ainino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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Met Ala Pro He Lys Val Gly Asp Ala He Pro Ala Val Glu Val Phe 
15 10 15 

Glu Gly Glu Pro Gly Asn Lys Val Asn Leu Ala Glu Leu Phe Lys Gly 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 45 

Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Glu Ala Leu Lys 
50 55 60 

Ala Lys Gly Val Gin Val Val Ala Cys Leu Ser Val Asn Asp Ala Phe 
65 70 75 80 

Val Thr Gly Glu Trp Gly Arg Ala His Lys Ala Glu Gly Lys Val Arg 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu Thr Asp Leu Leu 
100 105 110 

Leu Asp Asp Ser Leu Val Ser He Phe Gly Asn Arg Arg Leu Lys Arg 
115 120 125 

Phe Ser Met Val Val Gin Asp Gly lie Val Lys Ala Leu Asn Val Glu 
130 135 140 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn He He Ser 
145 150 155 160 

Gin Leu * 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Rattus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 136. . 624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TGCGTCCTAG GCAGCATAGC CGGATCGGTG CTCCGTGCAT CGGCTACTTG GACGTGCGTG 60 
GCAGGCAGAG CAGGCCGGAA AGGAGCAGGT TGGGAGTGTG GTGGGGCCCG CAGCTTCAGC 120 
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AGTGCCGCGG TGACTATGGC CCCGATCAAG GTGGGAGACA CCATTCCCTC AGTGGAGGTA 18 0 

TTTGRAGGGG AACCTGGAAA GAAGGTGAAC TTGGCAGAGC TGTTCAAGGA CAAGAAAGGT 24 0 

GTTTTGTTTG GAGTCCCTGG GGCATTTACA CCTGGCTGTT CCAAGACCCA TCTGCCTGGG 300 

TTTGTGGAGC AAGCCGGAGC TCYGAAGGCC APiGGGAGCAC AAGTGGTGGC CTGTCTGAGT 360 

GTTAATGATG YCTTCGTGAC TGCAGAGTGG GGTCGAGCCC ACCAGGCAGA AGGCAAGGTT 420 

CAGCTCCTGG CTGACCCCAC TGGAGCTTTT GGAAAGGAGA CAGATTTACT ACTAGATGAT 48 0 

TCTTTGGTGT CTCTCTTTGG GAATCGTCGG CTAAAAAGGT TCTCCATGGT GATAGACAAG 54 0 

GGCGTAGTAA AGGCACTGAA CGTGGAGCCG GATGGCACAG GCCTCACCTG CAGCCTGGCC 60 0 

CCCAACATCC TCTCACAACT CTGAGGCCCT GACCAGAATG TCCTCTGACT CTCCCATCTC 660 

CTCCACCCAG CTCTGGGCCA AAGGCCCAGT ACCTCCTTAC CTGAGGGCCA CTGGAATGGA 72 0 

ACCTTGACAA TATTTCTGCA ATAAA.CAGTT TAATTTGTGfiL AAAAAAAAAA AAAAAAAAAA 780 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Rattus 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /product- "Glu or Gly" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 63 

(D) OTHER INFORMATION :/product= "Leu or Pro" 

(ix) FEATURE; 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 79 

(D) OTHER INFORMATION: /product- "Ala or Vai" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 4: 



Met Ala Pro lie Lys Val Gly Asp Thr He Pro Ser Val Glu Val Phe 
15 10 15 



wo 99/09054 PCT/BE98/00124 

5 

Xaa Gly Glu Pro Gly Lys Lys Val Asn Leu Ala Giu Leu Phe Lys Asp 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 45 

Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Gly Ala Xaa Lys 
50 55 60 

Ala Lys Gly Ala Gin Val Val Ala Cys Leu Ser Val Asn Asp Xaa Phe 
65 70 75 80 

Val Thr Ala Glu Trp Gly Arg Ala His Gin Ala Glu Gly Lys Val Gin 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu Thr Asp Leu Leu 
100 105 110 

Leu Asp Asp Ser Leu Val Ser Leu Phe Gly Asn Arg Arg Leu Lys Arg 
115 120 125 

Phe Ser Met Val lie Asp Lys Gly Val Val Lys Ala Leu Asn Val Glu 
130 135 140 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn lie Leu Ser 
145 150 155 160 

Gin Leu 



(2) INFORMATION FOR SEQ ID NO: 5: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 99. .588 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TGCTCCGTGC ATCGACGTGC TTGGCAGGCA GAGCAGGCCG GAAAGAAGCA GGTTGGGAGT 60 

GTGGCGGAGC CCGCAGCTTC AGCAGCTCCG CGGTGACCAT GGCCCCGATC AAGGTGGGAG 120 

ATGCCATTCC CTCAGTGGAG GTATTTGAAG GGG2\ACCGGG AAAGAAGGTG AACTTGGCAG 180 

AGCTGTTCAA GGGCAAGAAA GGTGTTTTGT TTGGAGTCCC TGGGGCATTT ACACCTGGCT 2 40 
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GTTCTAAGAC CCACCTGCCT GGGTTTGTGG AGCAAGCTGG AGCTCTGAAG GCTAAGGGAG 3 00 

CGCAGGTGGT GGCCTGTCTG AGCGTTAATG ACGTCTTTGT GATTGAAGAG TGGGGTCGAG 360 

CCCACCAGGC AGAAGGCAAG GTTCGGCTCC TGGCTGACCC CACTGGAGCC TTTGGGAAGG 420 

CGACAGACTT ATTATTGGAT GATTCTTTGG TGTCTCTCTT TGGGAATCGT CGGCTGAT^AA 4 80 

GGTTCTCCAT GGTGATAGAC AACGGCATAG TGAAGGCACT GAACGTGGAG CCAGATGGCA 54 0 

CAGGCCTCAC CTGCAGCCTG GCCCCCAACA TCCTCTCCCA ACTCTGAGGC CCTGGCCAGA 600 

TGTCCTCTGA CTCTCCCATC TCTCCCACCC GGCTCTAGGC CAAAAGGCTC GGTACCTCCT 660 

TACTGGGAGC CACGT 675 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Pro He Lys Val Gly Asp Ala He Pro Ser Val Glu Val Phe 

15 10 15 

Glu Gly Glu Pro Gly Lys Lys Val Asn Leu Ala Glu Leu Phe Lys Gly 
20 25 30 

Lys Lys Gly Vai Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 45 

Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Gly Ala Leu Lys 
50 55 60 

Ala Lys Gly Ala Gin Val Val Ala Cys Leu Ser Val Asn Asp Val Phe 
65 70 75 80 

Val He Glu Glu Trp Gly Arg Ala His Gin Ala Glu Gly Lys Val Arg 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Ala Thr Asp Leu Leu 
100 105 110 

Leu Asp Asp Ser Leu Val Ser Leu Phe Gly Asn Arg Arg Leu Lys Arg 
115 120 125 
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Phe Ser Met Val lie Asp Asn Gly lie Val Lys Ala Leu Asn Val Glu 
130 135 140 



Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn lie Leu Ser 
145 150 155 160 



Gin Leu 



(2) INFOFIMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

{ix) FEATURE: 

{A} NAME/ KEY: CDS 

(B) LOCATION: 161. ,382 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 

TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 120 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA TCAAGGTTCG 180 

GCTCCTGGCT GATCCCACTG GGGCCTTTGG GAAGGAGACA GACTTATTAC TAGATGATTC 240 

GCTGGTGTCC ATCTTTGGGA ATCGACGTCT CAAGAGGTTC TCCATGGTGG TACAGGATGG 300 

CATAGTGAAG GCCCTGAATG TGGAACCAGA TGGCACAGGC CTCACCTGCA GCCTGGCACC 360 

CAATATCATC TCACAGCTCT GAGGCCCTGG GCCAGATTAC TTCCTCCACC CCTCCCTATC 420 

TCACCTGCCC AGCCGTGTGC TGGGGCCCTG CAATTGGAAT GTTGGCCAG 4 69 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 161* .514 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 

TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 12 0 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCJ^ TCAAGACACA 180 

CCTGCCAGGG TTTGTGGAGC AGGCTGAGGC TCTGAAGGCC AAGGGAGTCC AGGTGGTGGC 24 0 

f 

CTGTCTGAGT GTTAATGATG CCTTTGTGAC TGGCGAGTGG GGCCGAGCCC ACAAGGCGGA 300 

AGGCAAGGTT CGGCTCCTGG CTGATCCCAC TGGGGCCTTT GGGAAGGAGA CAGACTTATT 360 

ACTAGATGAT TCGCTGGTGT CCATCTTTGG GAATCGACGT CTCAAGAGGT TCTCCATGGT 420 

GGTACAGGAT GGCATAGTGA AGGCCCTGAA TGTGGAACCA GATGGCACAG GCCTCACCTG 480 

CAGCCTGGCA CCCAATATCA TCTCACAGCT CTGAGGCCCT GGGCCAGATT ACTTCCTCCA 54 0 

CCCCTCCCTA TCTCACCTGC CCAGCCCTGT GCTGGGGCCC TGCAATTGGA ATGTTGGCCA 600 

G 601 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION:161. .517 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 



60 
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TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 120 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA TCAAGGTGGG 180 

AGATGCCATC CCAGCAGTGG AGGTGTTTGA AGGGGAGCCA GGGAA.CAAGG TGAACCTGGC 240 

AGAGCTGTTC AAGGGCAAGA AGGGTGTGCT GTTTGGAGTT CCTGGGGCCT TCACCCCTGG 300 

ATGTTCCAAG GTTCGGCTCC TGGCTGATCC CACTGGGGCC TTTGGGAAGG AGACAGACTT 360 

ATTACTAGAT GATTCGCTGG TGTCCATCTT TGGGAATCGA CGTCTCAAGA GGTTCTCCAT 420 

GGTGGTACAG GATGGCATAG TGAAGGCCCT GAATGTGGAA CCAGATGGCA CAGGCCTCAC 4 80 

CTGCAGCCTG GCACCCAATA TCATCTCACA GCTCTGAGGC CCTGGGCCAG ATTACTTCCT 54 0 

CCACCCCTCC CTATCTCACC TGCCCAGCCC TGTGCTGGGG CCCTGCAATT GGAATGTTGG 600 

CCAG 604 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHTIRACTERISTICS : 

(A) LENGTH: 2710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2516. .2710 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2 07 4. .2135 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1932. .1970 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1728 . .1859 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 802. . 936 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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TCTGTCCCTT AGCGCCCCCG CGGGGGCTTA CCCCATCCCA CTCCATGACC TCCCCTCCCC 60 

CCATGGCGAA TTCCCACCTT TCTGTCTTTC ACTCACTTCC TGGAACCGTC CCCAGGGCCT 12 0 

TGGACCTTCC CCCTTCTCCT CCCAAACCTT GTGAGACCCC ATTCCCTTTC TACTTCATCC 18 0 

TGCTCTCAAC TTTTGGGCTC CTCAGAGGCC CTCACCCCTG ACTCTCTCTC CCTACCCACT 24 0 

CTGGTCCCAT GAAGCCCTCA AGTACTCTGG GGATGGATCC TTCCCCCTTC AAAAGATTCC 300 

TTCTTTTGTT CTACACCTCC TGGGTGTAGG GGCCTGGACA CCCTCCCCCA ACGTTCCACC 360 

TGCCGCTGCC CTTCCTCTTC CTCCTCCTGA GGGTGGGACC CTCAGACCTG GCCAAGATCC 420 

TCTCCCTCCA TGTTGTCAGG GACTCCTCCT CACCCCCAAA TACAGCCCTC TAGCCCCTGT 480 

CCATTTTATT CCACTCCTTT CCTGTAACCT AGACAGCATG TTATGCAACC CTTTGCGACA 540 

CATGGGGAAA CCTTCCCTCC CTTCCTCTGT TGTCACCT^T GGCCCCTTAA GAGGAGCAGG 600 

GCCACCTTGA AACTTGGAGG ATATGGGGTA ACCCAGTGcb AGCGGGCAGG GAGGGCCCTT 660 

GGAAACTGAC AGGGCTGGAG TATCCTGCTG GGTTTCAGCC CCGGTTCCTG CAGGCACAGC 720 

TGCCAGGCTC TCTGTTCACC TTCCTGCCTC TGGTTTGCCC CGGCTCCCTC ACCCCCCTTA 780 

CCCTGGAGTC CTTCCTTCTA GGTGGGAGAT GCCATCCCAG CAGTGGAGGT GTTTGAAGGG 84 0 

GAGCCAGGGA ACAAGGTGAA CCTGGCAGAG CTGTTCAAGG GCAAGAAGGG TGTGCTGTTT 900 

GGAGTTCCTG GGGCCTTCAC CCCTGGATGT TCCAAGGTGA GGCCCTTCCC CTTCTGAAGA 960 

TCAGGACCTG GGGATCTTTT GTGTTGCTCT TAAGTCCTCC ACATAGTCCT GATAGGACTC 1020 

CTAAAAAGCA TTTCAGTGCC ATCACAAAAC AAGTAGAGCT GGGTAGAGCT GGGCGCGGTG 108 0 

GCTCACGCCT GTAATCCCAG CACTTTGGGA GGCCAAGGCG GGTGGATCAC GAGGTCAGGA 114 0 

GTCCAAAACC AGCCTGGCCA AGATGGTGAA ACCCTGTCTC TACTTWW^T GCAAAAAAAT 120 0 

CAGCCGGATA TGGTGGCGGG CGCCTGTAAT CCCAGGTATT GGGGAGGCTG AGGCAGAGAA 12 60 

TTGCTTGAAC CCAGGAGGCG TAGGTTGCAG TGAGTGGAGA TCGTGCCTCT GCAGTCCAGC 1320 

CTGGGTGAAA GAGCGAGACT CCGTCTCAAA ATGAAAAAAA AAAAAGAAAA CAAGTAGAGA 138 0 

CTGCAAAAAG GGAACAGTAC CGGGAATGTT GGAGAAAAAC ATACTACAAT TAAATCCAAC 144 0 

ACCCCTGTTG GTCCTGCTAA ATGACAGGCA CTGTGGAAGG TGCTTGGGAC TCAGATAAAT 1500 

AAGACAAAGA TCTGCCCATG GAAAGTTCAC GTCTGGACCA TAAGGCATTA GGTTTCATTC 15 60 

TGAGCTTCCT AGTGGCCAAG GCAAAAAGGA AATAGAATGG TTTAGACAGC TCTCATTGTC 1620 

TGATCAAAGG TGTTGAGGCA GAGCACTGAG GAGGGCCTGG AGATAAAGGG TGGGCTGGGG 168 0 

GTCAGATGCA GTTATCCCTT TGCCGACCCT TTGTTCCCCT TCCTCAGACA CACCTGCCAG 174 0 

GGTTTGTGGA GCAGGCTGAG GCTCTGAAGG CCAAGGGAGT CCAGGTGGTG GCCTGTCTGA 1800 

GTGTTAATGA TGCCTTTGTG ACTGGCGAGT GGGGCCGAGC CCACAAGGCG GAAGGCAAGG 18 60 
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TGAGGTGAGG GGCCTGCAGG GAGTCAGGAC CAGGTAGGAT ATTCTTCTTG TGACCTCTAC 1920 

TTTCTCTGCA GGTTCGGCTC CTGGCTGATC CCACTGGGGC CTTTGGGAAG GTGAGTGTTC 198 0 

CCCTGACCGC CACAGGGACA TGGCGGTGCG GGGAGCAGTG GGGGCCCTTG GCCTCTTCAA 204 0 

GGATTTCTGA CACTTTTCTC TGTCTCTTCT TAGGAGACAG ACTTATTACT AGATGATTCG 2100 

CTGGTGTCCA TCTTTGGGAA TCGACGTCTC AAGAGGTAAA AGTGGAGAGT CCTCTGTGGA 2160 

GAAAGTCCTC TGTGGGAGAG AGTCCTCTGT GGGAGAGAGT CCTCTGTGGA GAGGGTCCTC 2220 

TGTGGGAAGA GTCGTCTGTG GGGGAGATGT GTGGGAGAGA GTCCTGTGTG GGGAGAGTCT 228 0 

TCTGTAGGGG AGAGTCCTCT GGGGAGAGAG TCCTGTGTGG GGGAGAGTCC TCTGTGGGGA 2340 

GAGTCCTCTG TGTGGAGAGA GTCCTGTGTG GTGGTGAGTC CTCTGTGGGG GAGAGTCCTC 24 00 

TGTGGGGGGA GTCCTCTCTG GAGTTCTCTT GGGCCCCTGG CTGTTCACTG CCTGTCTCCA 24 60 

TGCCCAGCCT CCAAGCCCAG GCTGATGCAG CTGGCTGGGC CCCTCTTTCC GGCAGGTTCT 252 0 

CCATGGTGGT ACAGGATGGC ATAGTGAAGG CCCTGAATGT GGAACCAGAT GGCACAGGCC 258 0 

TCACCTGCAG CCTGGCACCC AATATCATCT CACAGCTCTG AGGCCCTGGG CCAGATTACT 264 0 

TCCTCCACCC CTCCCTATCT CACCTGCCCA GCCCTGTGCT GGGGCCCTGC AATTGGAATG 2700 

TTGGCCAGAT 2710 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCATCCCAG CAGTGGAGGT GTTTG 
(2) INFORMATION FOR SEQ ID NO: 12: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTGAACAGCT CTGCCAGGTT CACC 2 4 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGGAGGTGTT TGAAGGGGAG CCAG 24 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAGGTTCACC TTGTTCCCTG GCTC 2 4 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGGTATGGGA CTAGCTGGCG 20 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CTGGCCAACA TTCCAATTGC AG 



22 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATGTTATGCA ACCCTTTGCG ACAC 24 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 18: 
GTGTTTGAAG GGGAGCCAGG GAAC 24 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGAGACAGGG TTTCACCATC TTGG 



24 
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